- 26 Sep, 2017 4 commits
-
-
* Move target to tvm; rename convolution as conv2d * Fix * Fix
ziheng committed -
* add squeeze * should be squeeze
Xingjian Shi committed -
Tianqi Chen committed
-
Tianqi Chen committed
-
- 25 Sep, 2017 6 commits
-
-
Salem Derisavi committed
-
* [TOPI]add reshape * fix problems * fix lint * try to add concatenate * fix lint and error * fix doc * fix error * try to add split * fix lint * fix error * fix lint
Xingjian Shi committed -
Tianqi Chen committed
-
Tianqi Chen committed
-
Tianqi Chen committed
-
* [RUNTIME] Minimum graph runtime * update docs
Tianqi Chen committed
-
- 24 Sep, 2017 4 commits
-
-
Leyuan Wang committed
-
Yuwei HU committed
-
Tianqi Chen committed
-
* conv2d layout change and packing added for the last workload * packing added for other workloads * conv2d added packing for first workload * fix pylint error
Leyuan Wang committed
-
- 23 Sep, 2017 6 commits
-
-
Yuwei HU committed
-
Tianqi Chen committed
-
Tianqi Chen committed
-
Tianqi Chen committed
-
* migrate global_avg_pool, fully_connected * fix pylint * enable fusion of pooling schedule * rename fc->dense, enable fusion * improve dense schedule * unified global pool
Yuwei HU committed -
* [TEST] rfactor+ewise, cite rfactor paper * include all authors via abbrv * [TOPI] Add transpose * fix lint
Tianqi Chen committed
-
- 22 Sep, 2017 4 commits
-
-
Tianqi Chen committed
-
* [INTRIN] Enable pow * rename pow->power * fix
Tianqi Chen committed -
Tianqi Chen committed
-
Tianqi Chen committed
-
- 21 Sep, 2017 2 commits
-
-
vsooda committed
-
Tianqi Chen committed
-
- 20 Sep, 2017 2 commits
-
-
Tianqi Chen committed
-
* [CODEGEN] Redo CodegenLLVM. * Add remarks about origin of the pass Properly acknowledge related projects * Fix and expression
Tianqi Chen committed
-
- 19 Sep, 2017 1 commit
-
-
Tianqi Chen committed
-
- 18 Sep, 2017 5 commits
-
-
* [METAL] use 32bit indexing for metal until we have a bound adapted pass * fix lint
Tianqi Chen committed -
Tianqi Chen committed
-
Xingjian Shi committed
-
* [RPC] Expose module handle * not include handle
Tianqi Chen committed -
* [RPC] Include rpc session info into context * add type checker in return converison
Tianqi Chen committed
-
- 17 Sep, 2017 2 commits
-
-
* [PASS] Fix intrinsic lowering with fma and other intrin * relax rtol for sqrt
Tianqi Chen committed -
* add binary broadacst * fix testing * revise testing threshold
Xingjian Shi committed
-
- 14 Sep, 2017 1 commit
-
-
Aditya Atluri committed
-
- 13 Sep, 2017 2 commits
-
-
* added initial llvm codegen for amdgpu * fixed whitespace * fixed hsaco gen from ir * fixed targetmachine for rocm and added GetSource for rocm * fixed whitespace issues * changed statement to use less than 100 lines * added intrinsics for workgroup - rocm * whitespace - newline error fix * fixed error msg for workitem-workgroup intrinsics * added llvm ir dump for rocm codegen * [ROCM] changed codegen to emit proper amdgpu kernel header * fixed whitespace error * fixed whitespace error- 2 * fixed AddFunction to not to use extra arg 1. Changed AddFunctionInternal to not to take extra arg for target type 2. Use Target from CodeGenLLVM to check for AMDGPU target * fixed whitespaces * fixed whitespaces 2 * fixed codegen for AMDGPU - now generating valid IR * fixed codegen depending on code review * reviewed alignment for amd devices * added code to dump code object to file * fixed cpplint errors * print out IR after pass manager * added code to dump asm, obj to file and std string * fixed whitespaces * Update codegen_amdgpu.cc * used registry for amdgpu llvm * Fixed whitespaces * added code for calling linker * fixed formatting errors * added rocm link python interface * fixed pylint issues and added more body to the function * added doc string * added doc string for module * fixed python code after review, fixed llvm object codegen * fixed linker to generate code object * removed dumping to output file and debugging log out * fixed lint for python code * added fault check after running linker * removed print statement in rocm.py * changed rocm lld linker to raise runtimeerror than emitting error log to stderr * changed the way linker command line is pass to subprocess.popen * removed redundant code and reuse tvm utils * removed commented out code * removed cloning of unused modules, and put IR into string
Aditya Atluri committed -
Tianqi Chen committed
-
- 12 Sep, 2017 1 commit
-
-
Leyuan Wang committed
-