- 19 Jan, 2018 3 commits
-
-
* fix upsampling output shape * simplify expr in get_const_tuple
masahi committed -
* Add Mali target support to tvm.target.create * Add Mali target support in codegen
Jammy Zhou committed -
The type of parameter options should be a str list.
solin319 committed
-
- 16 Jan, 2018 5 commits
-
-
* add basic x86 schedules * parallelize & vectorize batchnorm + relu * fuse conv into bn + relu * move rc loop to outer * add nhwc conv * change weight layout to hwcf * conv + bn + relu fusion for nhwc conv * fix conv_nhwc schedule when no fusion * clean up default parallel schedules * simplify elemwise parallel * fix elemwise parallel for batch == 1 * update nhwc conv test * fix and add comment * fix lint * remove redundant import * remove default multithreading for some ops * remove default multithreading for global pool
masahi committed -
Lianmin Zheng committed
-
Xingjian Shi committed
-
* add schedule for ARM Mali GPU * fix lint * fix lint
Lianmin Zheng committed -
* support more argument type in depthwise_conv2d * mark all pointer as 'restrict' & fix vector conversion for opencl
Lianmin Zheng committed
-
- 15 Jan, 2018 2 commits
-
-
try to fix fix
Xingjian Shi committed -
Aman committed
-
- 12 Jan, 2018 1 commit
-
-
* [LLVM] Enable same target option in JITModule * not set mcpu explicitly
Tianqi Chen committed
-
- 11 Jan, 2018 2 commits
-
-
* add upsampling cpu op * add upsampling gpu schedule * add doc for upsampling op add more doc * cleanup upsampling test * add doc * fix lint * fix lint * fix lint * remove unused import * remove skimage dependency * remove skimage import * remove schedule_upsampling
masahi committed -
Yuwei Hu committed
-
- 10 Jan, 2018 1 commit
-
-
Tianqi Chen committed
-
- 09 Jan, 2018 2 commits
-
-
* small fixs on docs * add IR output after parallelization
Yida Wang committed -
* [PASS] Improve loop partition to remove un-necessary warning. * fix comment
Tianqi Chen committed
-
- 08 Jan, 2018 2 commits
-
-
* Improve opt_gemm tutorial * Addressed comments
yuruofeifei committed -
* [PASS] StorageRewrite Fold Inplace op storage when possible * update comment to fix typos
Tianqi Chen committed
-
- 07 Jan, 2018 1 commit
-
-
* [SCHEDULE]enable partition const loop with build flag (#719) * enable partition loop with build flag * add a testcase, and modify LoopPartition related cases * * add document for split_const_loop * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719) * add idx_num in class * using typical index [i, j, k] first, then i_suffix * keep inputs names * fix lint * improve comment of name * fix lint * [SCHEDULE]Improve bound deduce for loop partition (#743) * add divided checking when deducing * related testcase * fix * * transform LE and GE first * remove is_equal * modify testcase for edge cases checking * * fix comment * * fix lint * * apply transformation form LT -> LE, GT -> GE * * fix lint * simplify code and testcase * add negative co-efficient case * More complicated cases * add testcase * simplify testcase * comment case for now * fix testcase
xqdan committed
-
- 04 Jan, 2018 2 commits
-
-
* [CODEGEN] use charp for voidp * fx
Tianqi Chen committed -
Yizhi Liu committed
-
- 03 Jan, 2018 3 commits
-
-
* rocblas integration * fix include * fix lint
masahi committed -
* modified schedule_dataflow_rewrite.cc to fix losing tensor problem * modified schedule_dataflow_rewrite.cc for lint scan * modified schedule_dataflow_rewrite.cc for lint scan * using tensor's value_index to index output of stage op
libing4752 committed -
* [CODEGEN] update codegen for vector operation * update comment, fix for metal * fix some bugs in codegen * use 'restrict' in every argument * fix * fix
Lianmin Zheng committed
-
- 02 Jan, 2018 1 commit
-
-
* add cublas support * integrate cublas to topi dense * add cublas error check * minor fix * fix lint * remove topi import from contrib unittest
masahi committed
-
- 31 Dec, 2017 2 commits
-
-
* [SCHEDULE]enable partition const loop with build flag (#719) * enable partition loop with build flag * add a testcase, and modify LoopPartition related cases * * add document for split_const_loop * [IRbuild]Support automatically Name Loop Variable in IRBuilder (#719) * add idx_num in class * using typical index [i, j, k] first, then i_suffix * keep inputs names * fix lint * improve comment of name * fix lint
xqdan committed -
Tianqi Chen committed
-
- 29 Dec, 2017 3 commits
-
-
* [SCHEDULE]enable partition const loop with build flag (#719) * enable partition loop with build flag * add a testcase, and modify LoopPartition related cases * * add document for split_const_loop
xqdan committed -
* use cudnn findalgo to choose the best algo * fix lint
masahi committed -
* when there is no intrin func, using body for initialization. For issue 714. * Refine code per review comments, and add a test case. * Fix lint issues. * Re-organize the tensorize test cases, and add a new case for none-reset mode. * Fix a typo. * Delete the unit case because merged it into test_schedule_tensorize.py already.
kun-zh committed
-
- 27 Dec, 2017 3 commits
-
-
* when there is no intrin func, using body for initialization. For issue 714. * Refine code per review comments, and add a test case. * Fix lint issues.
kun-zh committed -
* support dim-0 tensor in topi ops revert transform * revert
Xingjian Shi committed -
* add target.libs to target str representation * integrate cudnn into topi cuda * append target.libs to target.options
masahi committed
-
- 26 Dec, 2017 2 commits
-
-
* add extern schedule for miopen * fix comment * optionally dispatch to miopen from topi * fix lint * check if current target is None * use generic dispatch for rocm conv2d * fix lint * fix workspace bug * remove blank line * remove blank line * remove blank line
masahi committed -
Tianqi Chen committed
-
- 25 Dec, 2017 1 commit
-
-
* add x86_64 target * add binary dense operator * rebase * improve schedule * remove x86 target * improve schedule
Yuwei Hu committed
-
- 24 Dec, 2017 3 commits
-
-
* fist working miopen support * do FindFwdAlgo during build time * fix lint * update doc string * import topi after checking if rocm is enabled * add miopen namespace * fixed descriptor overwrite bug * add use_miopen option * fix lint * better miopen option handling * fix typo * fix options handling
masahi committed -
* [CODEGEN] update codegen for vector operation * update comment, fix for metal
Lianmin Zheng committed -
Tianqi Chen committed
-
- 23 Dec, 2017 1 commit
-
-
* Make duplicated function name checker working * Fix dependency checking problem for reducer condition (#712); add test * Fix dependency checking problem for reducer condition (#712); add test * Specify R to be computed inlined
Cody Hao Yu committed
-