- 23 Sep, 2017 1 commit
-
-
* [TEST] rfactor+ewise, cite rfactor paper * include all authors via abbrv * [TOPI] Add transpose * fix lint
Tianqi Chen committed
-
- 22 Sep, 2017 1 commit
-
-
* [INTRIN] Enable pow * rename pow->power * fix
Tianqi Chen committed
-
- 13 Sep, 2017 1 commit
-
-
* added initial llvm codegen for amdgpu * fixed whitespace * fixed hsaco gen from ir * fixed targetmachine for rocm and added GetSource for rocm * fixed whitespace issues * changed statement to use less than 100 lines * added intrinsics for workgroup - rocm * whitespace - newline error fix * fixed error msg for workitem-workgroup intrinsics * added llvm ir dump for rocm codegen * [ROCM] changed codegen to emit proper amdgpu kernel header * fixed whitespace error * fixed whitespace error- 2 * fixed AddFunction to not to use extra arg 1. Changed AddFunctionInternal to not to take extra arg for target type 2. Use Target from CodeGenLLVM to check for AMDGPU target * fixed whitespaces * fixed whitespaces 2 * fixed codegen for AMDGPU - now generating valid IR * fixed codegen depending on code review * reviewed alignment for amd devices * added code to dump code object to file * fixed cpplint errors * print out IR after pass manager * added code to dump asm, obj to file and std string * fixed whitespaces * Update codegen_amdgpu.cc * used registry for amdgpu llvm * Fixed whitespaces * added code for calling linker * fixed formatting errors * added rocm link python interface * fixed pylint issues and added more body to the function * added doc string * added doc string for module * fixed python code after review, fixed llvm object codegen * fixed linker to generate code object * removed dumping to output file and debugging log out * fixed lint for python code * added fault check after running linker * removed print statement in rocm.py * changed rocm lld linker to raise runtimeerror than emitting error log to stderr * changed the way linker command line is pass to subprocess.popen * removed redundant code and reuse tvm utils * removed commented out code * removed cloning of unused modules, and put IR into string
Aditya Atluri committed
-
- 01 Sep, 2017 1 commit
-
-
Tianqi Chen committed
-
- 28 Aug, 2017 1 commit
-
-
* [CODEGEN] NVPTX backend. * Fix pylint * use fix
Tianqi Chen committed
-
- 24 Jul, 2017 1 commit
-
-
* [STORAGE][BUFFER] Support access ptr for clear access pattern. * fix lint
Tianqi Chen committed
-
- 18 Jul, 2017 1 commit
-
-
* prefetch interface added * prefetch python comments modified. prefetch info data structure maintained. * start injecting prefetches. first step (domain touch) implemented. * domain touch tested. * Prefetch ir_mutator and ir_visitor dispatch registered. * modify domain touched from passing a func_ref to passing a tensor * modify domain touched from passing a func_ref to passing a tensor * modify Tensor copy to Tensor ref * temp commit for rebase * debug info removed, typo fixed, ready to rebase * prefetch flatten test add! * roll back builtin functions to side effect functions * lint error fixed! * add cache line size to storage flatten argument * forgot modifications add * change code style to dmlc-like; get rid of can_prove, use manually compute instead * python lint error fixed * modify instrinsic name to pass tests * [TEST] get rid of str(), replace them by accessing attributes * change map to list comprehension * redundant numpy import removed
Jian Weng committed
-
- 14 Jul, 2017 1 commit
-
-
* [REFACTOR] collections->container, RPC returns func, time_evaluator returns struct * fix executor
Tianqi Chen committed
-
- 06 Jul, 2017 1 commit
-
-
* [CODEGEN/PASS] add restricted, alignment option * fix lint * Fix the alloca
Tianqi Chen committed
-
- 30 Jun, 2017 1 commit
-
-
* Add NNPack Fully Connected Inference * Add NNPack fully_connected_output * Fix lint * Fix
ziheng committed
-
- 11 Jun, 2017 1 commit
-
-
* Support for batch ComputeOp * Support for batch ComputeOp * Fix CrossThreadReduction * Fix lint * Add UpdateArray, remove support for batch reduce * Tuple input support for reduce * rfactor works with multiple reducer; support multiple reducers with different types * Small fix * Small fix * Change return type of rfactor to Array<Expr> * Fix lint * Improve * Add tutorial * Improve tutorial * Improve tutorial
ziheng committed
-
- 03 Jun, 2017 1 commit
-
-
* [PERF/TIMER] Add buildin timing logic * fix lint
Tianqi Chen committed
-
- 02 Jun, 2017 1 commit
-
-
Tianqi Chen committed
-
- 24 May, 2017 1 commit
-
-
Tianqi Chen committed
-
- 22 May, 2017 1 commit
-
-
Tianqi Chen committed
-
- 10 May, 2017 1 commit
-
-
* [PASS] Use likely tag & enable LoopPartition by default * [PASS] Support thread_axis partition * Take IfThenElse branch method * [PASS] Insert branch at the innermost thread scope * [PASS] Select candidates before trying to partition & add test for select * [PASS] Clean code * Fix * Remove print & assert vectorize happens
ziheng committed
-
- 09 May, 2017 1 commit
-
-
* [MODULE/DSO] Support pack everything into one shared library. * fix osx load
Tianqi Chen committed
-
- 08 May, 2017 1 commit
-
-
Tianqi Chen committed
-
- 05 May, 2017 1 commit
-
-
* [CONTRIB/BLAS] Add CBLAS Example to contrib * Update makefile
Tianqi Chen committed
-
- 02 May, 2017 1 commit
-
-
* [CODEGEN/RUNTIME] Metal support, runtime improvement. * Fix case when no device is available
Tianqi Chen committed
-
- 28 Apr, 2017 1 commit
-
-
Tianqi Chen committed
-
- 26 Apr, 2017 1 commit
-
-
* [LANG] CommReducer * Reorganize c_api * Remove InitValue and Combine; refactor Functor * Make CommReducer an Expr * Make comm_reducer type independent * Make CommReducerNode a Node * Small fix * Refine * Refine front api; add integration testcases for min/max * Fix python * Refine * Fix lint and add example
ziheng committed
-
- 22 Apr, 2017 1 commit
-
-
* [LANG/CODEGEN] Intrinsics and Extern Math * fix lint
Tianqi Chen committed
-
- 18 Apr, 2017 1 commit
-
-
Tianqi Chen committed
-
- 16 Apr, 2017 1 commit
-
-
* [DOC] API doc organization. * remove breathe for now
Tianqi Chen committed
-
- 15 Apr, 2017 1 commit
-
-
* [DOC] Initial doc system * Migrate API * Update docs
Tianqi Chen committed
-
- 11 Apr, 2017 1 commit
-
-
Tianqi Chen committed
-
- 09 Apr, 2017 1 commit
-
-
* [SCHEDULE] Add group, refactor thread bind api. * fix doc * fix g++-4.8 * More testscase * Remove graph context from fix pt analysis
Tianqi Chen committed
-
- 01 Apr, 2017 1 commit
-
-
* [LANG/GPU] Cross Thread Reduction. * Fix doxygen error * Upgrade verilog testcase to new one
Tianqi Chen committed
-
- 29 Mar, 2017 1 commit
-
-
Tianqi Chen committed
-
- 26 Mar, 2017 1 commit
-
-
* [CODEGEN] Refactor common codegen, Verilog Codegen * fix make * fix mk * update enable signal * change function name to at neg edge * Move test to correct place
Tianqi Chen committed
-
- 05 Mar, 2017 1 commit
-
-
* [IterVar/REFACTOR] Add types to IterVar * [ARITH/REFACTOR] Move IntSet to include * [REFACTOR/OP] Move Op detail to seperate folder. * fix test
Tianqi Chen committed
-
- 03 Mar, 2017 1 commit
-
-
Tianqi Chen committed
-
- 27 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 26 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 24 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 20 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 17 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 14 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 09 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-