1. 23 Sep, 2017 1 commit
  2. 22 Sep, 2017 1 commit
  3. 13 Sep, 2017 1 commit
    • [BACKEND] initial llvm codegen for amdgpu (#402) · 891e226b
      * added initial llvm codegen for amdgpu
      
      * fixed whitespace
      
      * fixed hsaco gen from ir
      
      * fixed targetmachine for rocm and added GetSource for rocm
      
      * fixed whitespace issues
      
      * changed statement to use less than 100 lines
      
      * added intrinsics for workgroup - rocm
      
      * whitespace - newline error fix
      
      * fixed error msg for workitem-workgroup intrinsics
      
      * added llvm ir dump for rocm codegen
      
      * [ROCM] changed codegen to emit proper amdgpu kernel header
      
      * fixed whitespace error
      
      * fixed whitespace error- 2
      
      * fixed AddFunction to not to use extra arg
      
      1. Changed AddFunctionInternal to not to take extra arg for target type
      2. Use Target from CodeGenLLVM to check for AMDGPU target
      
      * fixed whitespaces
      
      * fixed whitespaces 2
      
      * fixed codegen for AMDGPU - now generating valid IR
      
      * fixed codegen depending on code review
      
      * reviewed alignment for amd devices
      
      * added code to dump code object to file
      
      * fixed cpplint errors
      
      * print out IR after pass manager
      
      * added code to dump asm, obj to file and std string
      
      * fixed whitespaces
      
      * Update codegen_amdgpu.cc
      
      * used registry for amdgpu llvm
      
      * Fixed whitespaces
      
      * added code for calling linker
      
      * fixed formatting errors
      
      * added rocm link python interface
      
      * fixed pylint issues and added more body to the function
      
      * added doc string
      
      * added doc string for module
      
      * fixed python code after review, fixed llvm object codegen
      
      * fixed linker to generate code object
      
      * removed dumping to output file and debugging log out
      
      * fixed lint for python code
      
      * added fault check after running linker
      
      * removed print statement in rocm.py
      
      * changed rocm lld linker to raise runtimeerror than emitting error log to stderr
      
      * changed the way linker command line is pass to subprocess.popen
      
      * removed redundant code and reuse tvm utils
      
      * removed commented out code
      
      * removed cloning of unused modules, and put IR into string
      Aditya Atluri committed
  4. 01 Sep, 2017 1 commit
  5. 28 Aug, 2017 1 commit
  6. 24 Jul, 2017 1 commit
  7. 18 Jul, 2017 1 commit
    • [API] Prefetch schedule supported (#258) · 01cbc61a
      * prefetch interface added
      
      * prefetch python comments modified. prefetch info data structure maintained.
      
      * start injecting prefetches. first step (domain touch) implemented.
      
      * domain touch tested.
      
      * Prefetch ir_mutator and ir_visitor dispatch registered.
      
      * modify domain touched from passing a func_ref to passing a tensor
      
      * modify domain touched from passing a func_ref to passing a tensor
      
      * modify Tensor copy to Tensor ref
      
      * temp commit for rebase
      
      * debug info removed, typo fixed, ready to rebase
      
      * prefetch flatten test add!
      
      * roll back builtin functions to side effect functions
      
      * lint error fixed!
      
      * add cache line size to storage flatten argument
      
      * forgot modifications add
      
      * change code style to dmlc-like; get rid of can_prove, use manually compute instead
      
      * python lint error fixed
      
      * modify instrinsic name to pass tests
      
      * [TEST] get rid of str(), replace them by accessing attributes
      
      * change map to list comprehension
      
      * redundant numpy import removed
      Jian Weng committed
  8. 14 Jul, 2017 1 commit
  9. 06 Jul, 2017 1 commit
  10. 30 Jun, 2017 1 commit
  11. 11 Jun, 2017 1 commit
    • Support for Tuple Inputs of Reducer and ComputeOp (#175) · f467f66e
      * Support for batch ComputeOp
      
      * Support for batch ComputeOp
      
      * Fix CrossThreadReduction
      
      * Fix lint
      
      * Add UpdateArray, remove support for batch reduce
      
      * Tuple input support for reduce
      
      * rfactor works with multiple reducer; support multiple reducers with different types
      
      * Small fix
      
      * Small fix
      
      * Change return type of rfactor to Array<Expr>
      
      * Fix lint
      
      * Improve
      
      * Add tutorial
      
      * Improve tutorial
      
      * Improve tutorial
      ziheng committed
  12. 03 Jun, 2017 1 commit
  13. 02 Jun, 2017 1 commit
  14. 24 May, 2017 1 commit
  15. 22 May, 2017 1 commit
  16. 10 May, 2017 1 commit
    • [PASS] Use likely tag & enable LoopPartition by default (#132) · e9debc9b
      * [PASS] Use likely tag & enable LoopPartition by default
      
      * [PASS] Support thread_axis partition
      
      * Take IfThenElse branch method
      
      * [PASS] Insert branch at the innermost thread scope
      
      * [PASS] Select candidates before trying to partition & add test for select
      
      * [PASS] Clean code
      
      * Fix
      
      * Remove print & assert vectorize happens
      ziheng committed
  17. 09 May, 2017 1 commit
  18. 08 May, 2017 1 commit
  19. 05 May, 2017 1 commit
  20. 02 May, 2017 1 commit
  21. 28 Apr, 2017 1 commit
  22. 26 Apr, 2017 1 commit
    • [LANG] CommReducer (#103) · 26d91985
      * [LANG] CommReducer
      
      * Reorganize c_api
      
      * Remove InitValue and Combine; refactor Functor
      
      * Make CommReducer an Expr
      
      * Make comm_reducer type independent
      
      * Make CommReducerNode a Node
      
      * Small fix
      
      * Refine
      
      * Refine front api; add integration testcases for min/max
      
      * Fix python
      
      * Refine
      
      * Fix lint and add example
      ziheng committed
  23. 22 Apr, 2017 1 commit
  24. 18 Apr, 2017 1 commit
  25. 16 Apr, 2017 1 commit
  26. 15 Apr, 2017 1 commit
  27. 11 Apr, 2017 1 commit
  28. 09 Apr, 2017 1 commit
  29. 01 Apr, 2017 1 commit
  30. 29 Mar, 2017 1 commit
  31. 26 Mar, 2017 1 commit
  32. 05 Mar, 2017 1 commit
  33. 03 Mar, 2017 1 commit
  34. 27 Feb, 2017 1 commit
  35. 26 Feb, 2017 1 commit
  36. 24 Feb, 2017 1 commit
  37. 20 Feb, 2017 1 commit
  38. 17 Feb, 2017 1 commit
  39. 14 Feb, 2017 1 commit
  40. 09 Feb, 2017 1 commit