1. 02 Feb, 2018 1 commit
  2. 29 Dec, 2017 1 commit
  3. 04 Dec, 2017 1 commit
  4. 12 Oct, 2017 1 commit
  5. 13 Sep, 2017 1 commit
    • [BACKEND] initial llvm codegen for amdgpu (#402) · 891e226b
      * added initial llvm codegen for amdgpu
      
      * fixed whitespace
      
      * fixed hsaco gen from ir
      
      * fixed targetmachine for rocm and added GetSource for rocm
      
      * fixed whitespace issues
      
      * changed statement to use less than 100 lines
      
      * added intrinsics for workgroup - rocm
      
      * whitespace - newline error fix
      
      * fixed error msg for workitem-workgroup intrinsics
      
      * added llvm ir dump for rocm codegen
      
      * [ROCM] changed codegen to emit proper amdgpu kernel header
      
      * fixed whitespace error
      
      * fixed whitespace error- 2
      
      * fixed AddFunction to not to use extra arg
      
      1. Changed AddFunctionInternal to not to take extra arg for target type
      2. Use Target from CodeGenLLVM to check for AMDGPU target
      
      * fixed whitespaces
      
      * fixed whitespaces 2
      
      * fixed codegen for AMDGPU - now generating valid IR
      
      * fixed codegen depending on code review
      
      * reviewed alignment for amd devices
      
      * added code to dump code object to file
      
      * fixed cpplint errors
      
      * print out IR after pass manager
      
      * added code to dump asm, obj to file and std string
      
      * fixed whitespaces
      
      * Update codegen_amdgpu.cc
      
      * used registry for amdgpu llvm
      
      * Fixed whitespaces
      
      * added code for calling linker
      
      * fixed formatting errors
      
      * added rocm link python interface
      
      * fixed pylint issues and added more body to the function
      
      * added doc string
      
      * added doc string for module
      
      * fixed python code after review, fixed llvm object codegen
      
      * fixed linker to generate code object
      
      * removed dumping to output file and debugging log out
      
      * fixed lint for python code
      
      * added fault check after running linker
      
      * removed print statement in rocm.py
      
      * changed rocm lld linker to raise runtimeerror than emitting error log to stderr
      
      * changed the way linker command line is pass to subprocess.popen
      
      * removed redundant code and reuse tvm utils
      
      * removed commented out code
      
      * removed cloning of unused modules, and put IR into string
      Aditya Atluri committed
  6. 28 Aug, 2017 1 commit
  7. 24 Jul, 2017 1 commit
  8. 18 Jul, 2017 1 commit
    • [API] Prefetch schedule supported (#258) · 01cbc61a
      * prefetch interface added
      
      * prefetch python comments modified. prefetch info data structure maintained.
      
      * start injecting prefetches. first step (domain touch) implemented.
      
      * domain touch tested.
      
      * Prefetch ir_mutator and ir_visitor dispatch registered.
      
      * modify domain touched from passing a func_ref to passing a tensor
      
      * modify domain touched from passing a func_ref to passing a tensor
      
      * modify Tensor copy to Tensor ref
      
      * temp commit for rebase
      
      * debug info removed, typo fixed, ready to rebase
      
      * prefetch flatten test add!
      
      * roll back builtin functions to side effect functions
      
      * lint error fixed!
      
      * add cache line size to storage flatten argument
      
      * forgot modifications add
      
      * change code style to dmlc-like; get rid of can_prove, use manually compute instead
      
      * python lint error fixed
      
      * modify instrinsic name to pass tests
      
      * [TEST] get rid of str(), replace them by accessing attributes
      
      * change map to list comprehension
      
      * redundant numpy import removed
      Jian Weng committed
  9. 08 Jul, 2017 1 commit
  10. 06 Jul, 2017 1 commit
  11. 10 May, 2017 1 commit
    • [PASS] Use likely tag & enable LoopPartition by default (#132) · e9debc9b
      * [PASS] Use likely tag & enable LoopPartition by default
      
      * [PASS] Support thread_axis partition
      
      * Take IfThenElse branch method
      
      * [PASS] Insert branch at the innermost thread scope
      
      * [PASS] Select candidates before trying to partition & add test for select
      
      * [PASS] Clean code
      
      * Fix
      
      * Remove print & assert vectorize happens
      ziheng committed
  12. 02 May, 2017 1 commit
  13. 30 Apr, 2017 1 commit
  14. 28 Apr, 2017 1 commit
  15. 18 Apr, 2017 1 commit
  16. 16 Apr, 2017 1 commit
  17. 15 Apr, 2017 1 commit
  18. 09 Apr, 2017 1 commit
  19. 05 Mar, 2017 1 commit
  20. 26 Feb, 2017 1 commit
  21. 24 Feb, 2017 1 commit
  22. 22 Feb, 2017 1 commit
  23. 04 Feb, 2017 1 commit
  24. 02 Feb, 2017 1 commit
  25. 31 Jan, 2017 2 commits