- 14 Nov, 2017 1 commit
-
-
Tianqi Chen committed
-
- 13 Nov, 2017 1 commit
-
-
Tianqi Chen committed
-
- 12 Nov, 2017 1 commit
-
-
Tianqi Chen committed
-
- 22 Oct, 2017 2 commits
-
-
Tianqi Chen committed
-
Wei Chen committed
-
- 19 Oct, 2017 1 commit
-
-
use `object.__eq__`(default object identity comparison) as default implementation of same_as. This should be OK since `EqualOp` and `NotEqualOp` are pure Python object, `object.__eq__` is sufficient.
Wei Chen committed
-
- 17 Oct, 2017 1 commit
-
-
* [PYTHON] Improve equal sugar * fix comment
Tianqi Chen committed
-
- 14 Oct, 2017 1 commit
-
-
* [TVM] Introduce target generic dispatch system * fix target warning
Tianqi Chen committed
-
- 13 Oct, 2017 2 commits
-
-
* Add same_as to NodeBase 1. Most class inherited from NodeBase(Schedule, Stage, etc) still have the convenience of using '==' for object identity. And this is the right behavior for non-Expr classes. 2. subclasses of ExprOp now create EQ expression when '==' is used. `__nonzero__` and `__bool__` in EQ and NE is a comprise that in some cases object identity semantics is still useful, like in unit test. For instance: ```` assert a == b ```` "a == b" will create EQ expression, assert then calls `__nonzero__` of the result expression. `Expr.__nonzero__` throws exception since it prohibits evaluating IR expression. More complex case like: ```` assert a in b # b is dict ```` it will call `__eq__` on a and all keys of b, then `__bool__` on the result expression. This could not easily be done by same_as. * Retain __hash__ from NodeBase in Python3
Wei Chen committed -
Hu Shiwen committed
-
- 12 Oct, 2017 2 commits
-
-
also improve comment and unit test
陳煒 committed -
* [RUNTIME] Enable ext_dev type for quick plugin of device * [TEST] Update testcase to cover all computation
Tianqi Chen committed
-
- 11 Oct, 2017 1 commit
-
-
* [PASS] copy intrin * update comment thanks to derisavi
Tianqi Chen committed
-
- 10 Oct, 2017 1 commit
-
-
* [ARITH] Improve detect linear equation * fix doc
Tianqi Chen committed
-
- 05 Oct, 2017 1 commit
-
-
Tianqi Chen committed
-
- 04 Oct, 2017 1 commit
-
-
Tianqi Chen committed
-
- 01 Oct, 2017 1 commit
-
-
* [LANG] Support for Bitwise Operation * Add test
ziheng committed
-
- 25 Sep, 2017 1 commit
-
-
* [RUNTIME] Minimum graph runtime * update docs
Tianqi Chen committed
-
- 17 Sep, 2017 1 commit
-
-
* [PASS] Fix intrinsic lowering with fma and other intrin * relax rtol for sqrt
Tianqi Chen committed
-
- 13 Sep, 2017 1 commit
-
-
* added initial llvm codegen for amdgpu * fixed whitespace * fixed hsaco gen from ir * fixed targetmachine for rocm and added GetSource for rocm * fixed whitespace issues * changed statement to use less than 100 lines * added intrinsics for workgroup - rocm * whitespace - newline error fix * fixed error msg for workitem-workgroup intrinsics * added llvm ir dump for rocm codegen * [ROCM] changed codegen to emit proper amdgpu kernel header * fixed whitespace error * fixed whitespace error- 2 * fixed AddFunction to not to use extra arg 1. Changed AddFunctionInternal to not to take extra arg for target type 2. Use Target from CodeGenLLVM to check for AMDGPU target * fixed whitespaces * fixed whitespaces 2 * fixed codegen for AMDGPU - now generating valid IR * fixed codegen depending on code review * reviewed alignment for amd devices * added code to dump code object to file * fixed cpplint errors * print out IR after pass manager * added code to dump asm, obj to file and std string * fixed whitespaces * Update codegen_amdgpu.cc * used registry for amdgpu llvm * Fixed whitespaces * added code for calling linker * fixed formatting errors * added rocm link python interface * fixed pylint issues and added more body to the function * added doc string * added doc string for module * fixed python code after review, fixed llvm object codegen * fixed linker to generate code object * removed dumping to output file and debugging log out * fixed lint for python code * added fault check after running linker * removed print statement in rocm.py * changed rocm lld linker to raise runtimeerror than emitting error log to stderr * changed the way linker command line is pass to subprocess.popen * removed redundant code and reuse tvm utils * removed commented out code * removed cloning of unused modules, and put IR into string
Aditya Atluri committed
-
- 12 Sep, 2017 1 commit
-
-
* [RUNTIME] Enable extension type to PackedFunc. * More comments
Tianqi Chen committed
-
- 11 Sep, 2017 1 commit
-
-
* [RUNTIME][RPC] Enable remote linking of device code. * fix build
Tianqi Chen committed
-
- 08 Sep, 2017 1 commit
-
-
* improved conv2d for last group of workloads * conv2d_nchw improved on 14_256_256 and 56_64_128
Leyuan Wang committed
-
- 07 Sep, 2017 1 commit
-
-
* [SCHEDULE] Enahance cache_write to enable layout change. * more tests
Tianqi Chen committed
-
- 05 Sep, 2017 1 commit
-
-
Tianqi Chen committed
-
- 03 Sep, 2017 2 commits
-
-
Tianqi Chen committed
-
Tianqi Chen committed
-
- 01 Sep, 2017 1 commit
-
-
Tianqi Chen committed
-
- 30 Aug, 2017 3 commits
-
-
Tianqi Chen committed
-
* [SCHEDULE][PASS] support storage_align of certain axis * fix lint
Tianqi Chen committed -
Tianqi Chen committed
-
- 28 Aug, 2017 1 commit
-
-
* [CODEGEN] NVPTX backend. * Fix pylint * use fix
Tianqi Chen committed
-
- 26 Aug, 2017 1 commit
-
-
* v2: runtime support for rocm * fixed coding space errors * removed kROCM from c_runtime_api.h
Aditya Atluri committed
-
- 16 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 15 Aug, 2017 1 commit
-
-
[TOPI] Add broadcast and reduce operators
Xingjian Shi committed
-
- 13 Aug, 2017 2 commits
-
-
Tianqi Chen committed
-
Tianqi Chen committed
-
- 12 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 11 Aug, 2017 1 commit
-
-
* [PASS][FIX] Fix LiftAttrScope with if * [PASS] Fix on proc sync * fix
Tianqi Chen committed
-
- 10 Aug, 2017 1 commit
-
-
* [TEST] Upgrade gpu docker to cudnn7 * fx
Tianqi Chen committed
-