- 23 Sep, 2017 1 commit
-
-
* [TEST] rfactor+ewise, cite rfactor paper * include all authors via abbrv * [TOPI] Add transpose * fix lint
Tianqi Chen committed
-
- 22 Sep, 2017 1 commit
-
-
* [INTRIN] Enable pow * rename pow->power * fix
Tianqi Chen committed
-
- 17 Sep, 2017 1 commit
-
-
* [PASS] Fix intrinsic lowering with fma and other intrin * relax rtol for sqrt
Tianqi Chen committed
-
- 13 Sep, 2017 1 commit
-
-
* added initial llvm codegen for amdgpu * fixed whitespace * fixed hsaco gen from ir * fixed targetmachine for rocm and added GetSource for rocm * fixed whitespace issues * changed statement to use less than 100 lines * added intrinsics for workgroup - rocm * whitespace - newline error fix * fixed error msg for workitem-workgroup intrinsics * added llvm ir dump for rocm codegen * [ROCM] changed codegen to emit proper amdgpu kernel header * fixed whitespace error * fixed whitespace error- 2 * fixed AddFunction to not to use extra arg 1. Changed AddFunctionInternal to not to take extra arg for target type 2. Use Target from CodeGenLLVM to check for AMDGPU target * fixed whitespaces * fixed whitespaces 2 * fixed codegen for AMDGPU - now generating valid IR * fixed codegen depending on code review * reviewed alignment for amd devices * added code to dump code object to file * fixed cpplint errors * print out IR after pass manager * added code to dump asm, obj to file and std string * fixed whitespaces * Update codegen_amdgpu.cc * used registry for amdgpu llvm * Fixed whitespaces * added code for calling linker * fixed formatting errors * added rocm link python interface * fixed pylint issues and added more body to the function * added doc string * added doc string for module * fixed python code after review, fixed llvm object codegen * fixed linker to generate code object * removed dumping to output file and debugging log out * fixed lint for python code * added fault check after running linker * removed print statement in rocm.py * changed rocm lld linker to raise runtimeerror than emitting error log to stderr * changed the way linker command line is pass to subprocess.popen * removed redundant code and reuse tvm utils * removed commented out code * removed cloning of unused modules, and put IR into string
Aditya Atluri committed
-
- 12 Sep, 2017 1 commit
-
-
* [RUNTIME] Enable extension type to PackedFunc. * More comments
Tianqi Chen committed
-
- 11 Sep, 2017 1 commit
-
-
* [RUNTIME][RPC] Enable remote linking of device code. * fix build
Tianqi Chen committed
-
- 08 Sep, 2017 1 commit
-
-
* improved conv2d for last group of workloads * conv2d_nchw improved on 14_256_256 and 56_64_128
Leyuan Wang committed
-
- 07 Sep, 2017 1 commit
-
-
* [SCHEDULE] Enahance cache_write to enable layout change. * more tests
Tianqi Chen committed
-
- 05 Sep, 2017 1 commit
-
-
Tianqi Chen committed
-
- 03 Sep, 2017 2 commits
-
-
Tianqi Chen committed
-
Tianqi Chen committed
-
- 01 Sep, 2017 1 commit
-
-
Tianqi Chen committed
-
- 30 Aug, 2017 3 commits
-
-
Tianqi Chen committed
-
* [SCHEDULE][PASS] support storage_align of certain axis * fix lint
Tianqi Chen committed -
Tianqi Chen committed
-
- 28 Aug, 2017 1 commit
-
-
* [CODEGEN] NVPTX backend. * Fix pylint * use fix
Tianqi Chen committed
-
- 26 Aug, 2017 1 commit
-
-
* v2: runtime support for rocm * fixed coding space errors * removed kROCM from c_runtime_api.h
Aditya Atluri committed
-
- 24 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 22 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 20 Aug, 2017 1 commit
-
-
* [BUILD][LLVM] Support LLVM mainline 5.0 6.0 * Reduce parallelism
Tianqi Chen committed
-
- 16 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 15 Aug, 2017 3 commits
-
-
* [Contrib] CuDNN v7 Support * Add test
ziheng committed -
* [TOPI] Move ewise.h -> elemwise.h * fix test
Tianqi Chen committed -
[TOPI] Add broadcast and reduce operators
Xingjian Shi committed
-
- 13 Aug, 2017 2 commits
-
-
Tianqi Chen committed
-
Tianqi Chen committed
-
- 12 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 11 Aug, 2017 1 commit
-
-
* [PASS][FIX] Fix LiftAttrScope with if * [PASS] Fix on proc sync * fix
Tianqi Chen committed
-
- 10 Aug, 2017 1 commit
-
-
* [TEST] Upgrade gpu docker to cudnn7 * fx
Tianqi Chen committed
-
- 09 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 08 Aug, 2017 3 commits
-
-
Tianqi Chen committed
-
* [tvm4j] RPC Server * [tvm4j] fix recursively function calling; connect to proxy server; osx rename .so to .dylib * [tvm4j] test case for proxy connection; thread pool for serving
Yizhi Liu committed -
* [RUNTIME][PASS] Allow declare vector type array * fix bcast * [BUFFER] Enable vload/store function in buffer * ok
Tianqi Chen committed
-
- 07 Aug, 2017 1 commit
-
-
* [NNPACK] Add nnpack.convolution * Add instrinsic * Fix lint
ziheng committed
-
- 05 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 04 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 03 Aug, 2017 2 commits
-
-
* [PASS] Refactor thread storage sync to a common visitor * Fix the sync scope check behavior
Tianqi Chen committed -
Tianqi Chen committed
-
- 01 Aug, 2017 1 commit
-
-
* [SCHEDULE] Fix fuse node order * Make fuse order consistent with split
Tianqi Chen committed
-
- 31 Jul, 2017 1 commit
-
-
William Moses committed
-