- 02 Feb, 2018 1 commit
-
-
* [BACKEND] Vulkan Runtime and SPIRV Codegen * fix doc
Tianqi Chen committed
-
- 29 Dec, 2017 1 commit
-
-
* [SCHEDULE]enable partition const loop with build flag (#719) * enable partition loop with build flag * add a testcase, and modify LoopPartition related cases * * add document for split_const_loop
xqdan committed
-
- 04 Dec, 2017 1 commit
-
-
* Support rank-0 tensor * fix lint
Tianqi Chen committed
-
- 12 Oct, 2017 1 commit
-
-
* [RUNTIME] Enable ext_dev type for quick plugin of device * [TEST] Update testcase to cover all computation
Tianqi Chen committed
-
- 13 Sep, 2017 1 commit
-
-
* added initial llvm codegen for amdgpu * fixed whitespace * fixed hsaco gen from ir * fixed targetmachine for rocm and added GetSource for rocm * fixed whitespace issues * changed statement to use less than 100 lines * added intrinsics for workgroup - rocm * whitespace - newline error fix * fixed error msg for workitem-workgroup intrinsics * added llvm ir dump for rocm codegen * [ROCM] changed codegen to emit proper amdgpu kernel header * fixed whitespace error * fixed whitespace error- 2 * fixed AddFunction to not to use extra arg 1. Changed AddFunctionInternal to not to take extra arg for target type 2. Use Target from CodeGenLLVM to check for AMDGPU target * fixed whitespaces * fixed whitespaces 2 * fixed codegen for AMDGPU - now generating valid IR * fixed codegen depending on code review * reviewed alignment for amd devices * added code to dump code object to file * fixed cpplint errors * print out IR after pass manager * added code to dump asm, obj to file and std string * fixed whitespaces * Update codegen_amdgpu.cc * used registry for amdgpu llvm * Fixed whitespaces * added code for calling linker * fixed formatting errors * added rocm link python interface * fixed pylint issues and added more body to the function * added doc string * added doc string for module * fixed python code after review, fixed llvm object codegen * fixed linker to generate code object * removed dumping to output file and debugging log out * fixed lint for python code * added fault check after running linker * removed print statement in rocm.py * changed rocm lld linker to raise runtimeerror than emitting error log to stderr * changed the way linker command line is pass to subprocess.popen * removed redundant code and reuse tvm utils * removed commented out code * removed cloning of unused modules, and put IR into string
Aditya Atluri committed
-
- 28 Aug, 2017 1 commit
-
-
* [CODEGEN] NVPTX backend. * Fix pylint * use fix
Tianqi Chen committed
-
- 24 Jul, 2017 1 commit
-
-
* [STORAGE][BUFFER] Support access ptr for clear access pattern. * fix lint
Tianqi Chen committed
-
- 18 Jul, 2017 1 commit
-
-
* prefetch interface added * prefetch python comments modified. prefetch info data structure maintained. * start injecting prefetches. first step (domain touch) implemented. * domain touch tested. * Prefetch ir_mutator and ir_visitor dispatch registered. * modify domain touched from passing a func_ref to passing a tensor * modify domain touched from passing a func_ref to passing a tensor * modify Tensor copy to Tensor ref * temp commit for rebase * debug info removed, typo fixed, ready to rebase * prefetch flatten test add! * roll back builtin functions to side effect functions * lint error fixed! * add cache line size to storage flatten argument * forgot modifications add * change code style to dmlc-like; get rid of can_prove, use manually compute instead * python lint error fixed * modify instrinsic name to pass tests * [TEST] get rid of str(), replace them by accessing attributes * change map to list comprehension * redundant numpy import removed
Jian Weng committed
-
- 08 Jul, 2017 1 commit
-
-
* [RUNTIME] Add workspace pool * fix doc * fix the free list * avoid zero size
Tianqi Chen committed
-
- 06 Jul, 2017 1 commit
-
-
* [CODEGEN/PASS] add restricted, alignment option * fix lint * Fix the alloca
Tianqi Chen committed
-
- 10 May, 2017 1 commit
-
-
* [PASS] Use likely tag & enable LoopPartition by default * [PASS] Support thread_axis partition * Take IfThenElse branch method * [PASS] Insert branch at the innermost thread scope * [PASS] Select candidates before trying to partition & add test for select * [PASS] Clean code * Fix * Remove print & assert vectorize happens
ziheng committed
-
- 02 May, 2017 1 commit
-
-
* [CODEGEN/RUNTIME] Metal support, runtime improvement. * Fix case when no device is available
Tianqi Chen committed
-
- 30 Apr, 2017 1 commit
-
-
* [CODEGEN/PASS] Improve callpacked lowering, allow pass array callback. * fix cython
Tianqi Chen committed
-
- 28 Apr, 2017 1 commit
-
-
Tianqi Chen committed
-
- 18 Apr, 2017 1 commit
-
-
Tianqi Chen committed
-
- 16 Apr, 2017 1 commit
-
-
* [DOC] API doc organization. * remove breathe for now
Tianqi Chen committed
-
- 15 Apr, 2017 1 commit
-
-
* [DOC] Initial doc system * Migrate API * Update docs
Tianqi Chen committed
-
- 09 Apr, 2017 1 commit
-
-
* [SCHEDULE] Add group, refactor thread bind api. * fix doc * fix g++-4.8 * More testscase * Remove graph context from fix pt analysis
Tianqi Chen committed
-
- 05 Mar, 2017 1 commit
-
-
* [IterVar/REFACTOR] Add types to IterVar * [ARITH/REFACTOR] Move IntSet to include * [REFACTOR/OP] Move Op detail to seperate folder. * fix test
Tianqi Chen committed
-
- 26 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 24 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 22 Feb, 2017 1 commit
-
-
* [LLVM] Initial support for codegen LLVM. * Fix the naming issue of codegen
Tianqi Chen committed
-
- 04 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 02 Feb, 2017 1 commit
-
-
Tianqi Chen committed
-
- 31 Jan, 2017 2 commits
-
-
[TEST/PYTHON] Add unittest folder, add a build pipeline. Rename Buffer.ptr to Buffer.data to be consistent with Array. (#29)
Tianqi Chen committed -
* [CODEGEN] CUDA/OPENCL pipeline complete * Hide TVMType by str in frontend
Tianqi Chen committed
-