- 22 Sep, 2017 3 commits
-
-
* [INTRIN] Enable pow * rename pow->power * fix
Tianqi Chen committed -
Tianqi Chen committed
-
Tianqi Chen committed
-
- 21 Sep, 2017 2 commits
-
-
vsooda committed
-
Tianqi Chen committed
-
- 20 Sep, 2017 2 commits
-
-
Tianqi Chen committed
-
* [CODEGEN] Redo CodegenLLVM. * Add remarks about origin of the pass Properly acknowledge related projects * Fix and expression
Tianqi Chen committed
-
- 19 Sep, 2017 1 commit
-
-
Tianqi Chen committed
-
- 18 Sep, 2017 5 commits
-
-
* [METAL] use 32bit indexing for metal until we have a bound adapted pass * fix lint
Tianqi Chen committed -
Tianqi Chen committed
-
Xingjian Shi committed
-
* [RPC] Expose module handle * not include handle
Tianqi Chen committed -
* [RPC] Include rpc session info into context * add type checker in return converison
Tianqi Chen committed
-
- 17 Sep, 2017 2 commits
-
-
* [PASS] Fix intrinsic lowering with fma and other intrin * relax rtol for sqrt
Tianqi Chen committed -
* add binary broadacst * fix testing * revise testing threshold
Xingjian Shi committed
-
- 14 Sep, 2017 1 commit
-
-
Aditya Atluri committed
-
- 13 Sep, 2017 2 commits
-
-
* added initial llvm codegen for amdgpu * fixed whitespace * fixed hsaco gen from ir * fixed targetmachine for rocm and added GetSource for rocm * fixed whitespace issues * changed statement to use less than 100 lines * added intrinsics for workgroup - rocm * whitespace - newline error fix * fixed error msg for workitem-workgroup intrinsics * added llvm ir dump for rocm codegen * [ROCM] changed codegen to emit proper amdgpu kernel header * fixed whitespace error * fixed whitespace error- 2 * fixed AddFunction to not to use extra arg 1. Changed AddFunctionInternal to not to take extra arg for target type 2. Use Target from CodeGenLLVM to check for AMDGPU target * fixed whitespaces * fixed whitespaces 2 * fixed codegen for AMDGPU - now generating valid IR * fixed codegen depending on code review * reviewed alignment for amd devices * added code to dump code object to file * fixed cpplint errors * print out IR after pass manager * added code to dump asm, obj to file and std string * fixed whitespaces * Update codegen_amdgpu.cc * used registry for amdgpu llvm * Fixed whitespaces * added code for calling linker * fixed formatting errors * added rocm link python interface * fixed pylint issues and added more body to the function * added doc string * added doc string for module * fixed python code after review, fixed llvm object codegen * fixed linker to generate code object * removed dumping to output file and debugging log out * fixed lint for python code * added fault check after running linker * removed print statement in rocm.py * changed rocm lld linker to raise runtimeerror than emitting error log to stderr * changed the way linker command line is pass to subprocess.popen * removed redundant code and reuse tvm utils * removed commented out code * removed cloning of unused modules, and put IR into string
Aditya Atluri committed -
Tianqi Chen committed
-
- 12 Sep, 2017 4 commits
-
-
Leyuan Wang committed
-
Clarify confusing error message for unmatched context
Shuai Yuan committed -
* rename the nchw and pass the unit test; going to do it for nhwc depthwise * bug with fusion * nchw works fine; nhwc float32 problem remains * still cannot bind them together * fusion works * syntax fix * all bugs fixed; test cases pass * minor fix on nn.h * back wrt input * backward wrt input nhwc; only test case in recipe * test case for depthwise back wrt input * test case for depthwise backward wrt weight * tags * minor fixes * pylint test; add arch=3.7 * modify scheduler * better backward depthwise w.r.t weight scheduler * updated scheduler * test_topi_depthwise_conv2d_back_input.py and test_topi_depthwise_conv2d_back_weight.py success * all test cases wrt input pass * update * new test cases and scheduler * not working 1 and 2 * good wrt weight, bad wrt input * test cases added * remove tf lines * minor fix * compute arch changed * remove compile hook * minor change * pylint * fix the float for python case * fix cases for python3 case * except for memoize * fix most; memoize still wrong * memoize added * unexpected layout cases added for scheduler * error message layout other than NHWC added * improve padding * fix as pr requests * remove dilate in backward wrt weight
wetliu committed -
* [RUNTIME] Enable extension type to PackedFunc. * More comments
Tianqi Chen committed
-
- 11 Sep, 2017 2 commits
-
-
* [DOCS] Add prerequisites about zlib1g-devin Add prerequisites about zlib1g-dev. It occurs `/usr/bin/ld: cannot find -lz` without zlib1g-dev. * Add prerequisites about python-setuptools Add prerequisites about python-setuptools. Otherwise, it will fail when executing `python setup install --user` command. * [DOCS] Add prerequisites about python-dev Add installation prerequisites about python-dev. Otherwise, it will fail with `SystemError: Cannot compile 'Python.h'. Perhaps you need to install python-dev|python-devel.` when executing `python setup install --user`.
Shuai Yuan committed -
* [RUNTIME][RPC] Enable remote linking of device code. * fix build
Tianqi Chen committed
-
- 10 Sep, 2017 1 commit
-
-
Yizhi Liu committed
-
- 09 Sep, 2017 3 commits
-
-
Yizhi Liu committed
-
Tianqi Chen committed
-
Tianqi Chen committed
-
- 08 Sep, 2017 1 commit
-
-
* improved conv2d for last group of workloads * conv2d_nchw improved on 14_256_256 and 56_64_128
Leyuan Wang committed
-
- 07 Sep, 2017 2 commits
-
-
* [SCHEDULE] Enahance cache_write to enable layout change. * more tests
Tianqi Chen committed -
Fix markdown syntax error (code shifts out of markdown-code box).
Shuai Yuan committed
-
- 06 Sep, 2017 2 commits
-
-
Tianqi Chen committed
-
* relu activation migrated to topi * reviews addressed * relu compute deleted * conv2d_nchw updated * resnet18 hand tuned schedule added * pylint error fixed * one more workload test for conv2d_nchw * conv2d schedule subfunctions added for different patterns * reviews addressed
Leyuan Wang committed
-
- 05 Sep, 2017 4 commits
-
-
Tianqi Chen committed
-
* [TEST] Add memoize to save test data * Update comment * mark py version
Tianqi Chen committed -
Tianqi Chen committed
-
* [SETUP] Always use relpath for setup * [CMAKE] Fix cmake llvm build
Tianqi Chen committed
-
- 04 Sep, 2017 1 commit
-
-
ziheng committed
-
- 03 Sep, 2017 2 commits
-
-
* CPU Schedule for raspberry pi * Update * Update * Add topi.target * Refactor * Update * Make python3 happy * Improve * Improve * Improve * Use get_const_int
ziheng committed -
Tianqi Chen committed
-