1. 22 Sep, 2017 1 commit
  2. 20 Sep, 2017 2 commits
  3. 13 Sep, 2017 1 commit
    • [BACKEND] initial llvm codegen for amdgpu (#402) · 891e226b
      * added initial llvm codegen for amdgpu
      
      * fixed whitespace
      
      * fixed hsaco gen from ir
      
      * fixed targetmachine for rocm and added GetSource for rocm
      
      * fixed whitespace issues
      
      * changed statement to use less than 100 lines
      
      * added intrinsics for workgroup - rocm
      
      * whitespace - newline error fix
      
      * fixed error msg for workitem-workgroup intrinsics
      
      * added llvm ir dump for rocm codegen
      
      * [ROCM] changed codegen to emit proper amdgpu kernel header
      
      * fixed whitespace error
      
      * fixed whitespace error- 2
      
      * fixed AddFunction to not to use extra arg
      
      1. Changed AddFunctionInternal to not to take extra arg for target type
      2. Use Target from CodeGenLLVM to check for AMDGPU target
      
      * fixed whitespaces
      
      * fixed whitespaces 2
      
      * fixed codegen for AMDGPU - now generating valid IR
      
      * fixed codegen depending on code review
      
      * reviewed alignment for amd devices
      
      * added code to dump code object to file
      
      * fixed cpplint errors
      
      * print out IR after pass manager
      
      * added code to dump asm, obj to file and std string
      
      * fixed whitespaces
      
      * Update codegen_amdgpu.cc
      
      * used registry for amdgpu llvm
      
      * Fixed whitespaces
      
      * added code for calling linker
      
      * fixed formatting errors
      
      * added rocm link python interface
      
      * fixed pylint issues and added more body to the function
      
      * added doc string
      
      * added doc string for module
      
      * fixed python code after review, fixed llvm object codegen
      
      * fixed linker to generate code object
      
      * removed dumping to output file and debugging log out
      
      * fixed lint for python code
      
      * added fault check after running linker
      
      * removed print statement in rocm.py
      
      * changed rocm lld linker to raise runtimeerror than emitting error log to stderr
      
      * changed the way linker command line is pass to subprocess.popen
      
      * removed redundant code and reuse tvm utils
      
      * removed commented out code
      
      * removed cloning of unused modules, and put IR into string
      Aditya Atluri committed
  4. 09 Sep, 2017 1 commit
  5. 05 Sep, 2017 1 commit
  6. 01 Sep, 2017 1 commit
  7. 31 Aug, 2017 1 commit
  8. 30 Aug, 2017 1 commit
  9. 28 Aug, 2017 1 commit
  10. 20 Aug, 2017 2 commits
  11. 16 Aug, 2017 1 commit
  12. 08 Aug, 2017 1 commit
  13. 05 Aug, 2017 1 commit
  14. 19 Jul, 2017 2 commits
  15. 18 Jul, 2017 1 commit
    • [API] Prefetch schedule supported (#258) · 01cbc61a
      * prefetch interface added
      
      * prefetch python comments modified. prefetch info data structure maintained.
      
      * start injecting prefetches. first step (domain touch) implemented.
      
      * domain touch tested.
      
      * Prefetch ir_mutator and ir_visitor dispatch registered.
      
      * modify domain touched from passing a func_ref to passing a tensor
      
      * modify domain touched from passing a func_ref to passing a tensor
      
      * modify Tensor copy to Tensor ref
      
      * temp commit for rebase
      
      * debug info removed, typo fixed, ready to rebase
      
      * prefetch flatten test add!
      
      * roll back builtin functions to side effect functions
      
      * lint error fixed!
      
      * add cache line size to storage flatten argument
      
      * forgot modifications add
      
      * change code style to dmlc-like; get rid of can_prove, use manually compute instead
      
      * python lint error fixed
      
      * modify instrinsic name to pass tests
      
      * [TEST] get rid of str(), replace them by accessing attributes
      
      * change map to list comprehension
      
      * redundant numpy import removed
      Jian Weng committed
  16. 16 Jul, 2017 1 commit
  17. 14 Jul, 2017 1 commit
  18. 11 Jul, 2017 1 commit
  19. 08 Jul, 2017 2 commits
  20. 07 Jul, 2017 1 commit
  21. 06 Jul, 2017 2 commits
  22. 27 Jun, 2017 1 commit
  23. 21 Jun, 2017 1 commit
  24. 18 Jun, 2017 2 commits
  25. 16 Jun, 2017 1 commit
  26. 15 Jun, 2017 1 commit
  27. 06 Jun, 2017 1 commit
  28. 04 Jun, 2017 1 commit
  29. 25 May, 2017 2 commits
  30. 21 May, 2017 1 commit
  31. 09 May, 2017 1 commit
  32. 04 May, 2017 1 commit
  33. 30 Apr, 2017 1 commit