- 19 Jan, 2018 1 commit
-
-
* fix upsampling output shape * simplify expr in get_const_tuple
masahi committed
-
- 16 Jan, 2018 5 commits
-
-
* add basic x86 schedules * parallelize & vectorize batchnorm + relu * fuse conv into bn + relu * move rc loop to outer * add nhwc conv * change weight layout to hwcf * conv + bn + relu fusion for nhwc conv * fix conv_nhwc schedule when no fusion * clean up default parallel schedules * simplify elemwise parallel * fix elemwise parallel for batch == 1 * update nhwc conv test * fix and add comment * fix lint * remove redundant import * remove default multithreading for some ops * remove default multithreading for global pool
masahi committed -
Lianmin Zheng committed
-
Xingjian Shi committed
-
* add schedule for ARM Mali GPU * fix lint * fix lint
Lianmin Zheng committed -
* support more argument type in depthwise_conv2d * mark all pointer as 'restrict' & fix vector conversion for opencl
Lianmin Zheng committed
-
- 15 Jan, 2018 1 commit
-
-
try to fix fix
Xingjian Shi committed
-
- 12 Jan, 2018 1 commit
-
-
* [LLVM] Enable same target option in JITModule * not set mcpu explicitly
Tianqi Chen committed
-
- 11 Jan, 2018 1 commit
-
-
* add upsampling cpu op * add upsampling gpu schedule * add doc for upsampling op add more doc * cleanup upsampling test * add doc * fix lint * fix lint * fix lint * remove unused import * remove skimage dependency * remove skimage import * remove schedule_upsampling
masahi committed
-
- 04 Jan, 2018 1 commit
-
-
Yizhi Liu committed
-
- 03 Jan, 2018 1 commit
-
-
* rocblas integration * fix include * fix lint
masahi committed
-
- 02 Jan, 2018 1 commit
-
-
* add cublas support * integrate cublas to topi dense * add cublas error check * minor fix * fix lint * remove topi import from contrib unittest
masahi committed
-
- 29 Dec, 2017 1 commit
-
-
* use cudnn findalgo to choose the best algo * fix lint
masahi committed
-
- 27 Dec, 2017 2 commits
-
-
* support dim-0 tensor in topi ops revert transform * revert
Xingjian Shi committed -
* add target.libs to target str representation * integrate cudnn into topi cuda * append target.libs to target.options
masahi committed
-
- 26 Dec, 2017 1 commit
-
-
* add extern schedule for miopen * fix comment * optionally dispatch to miopen from topi * fix lint * check if current target is None * use generic dispatch for rocm conv2d * fix lint * fix workspace bug * remove blank line * remove blank line * remove blank line
masahi committed
-
- 25 Dec, 2017 1 commit
-
-
* add x86_64 target * add binary dense operator * rebase * improve schedule * remove x86 target * improve schedule
Yuwei Hu committed
-
- 04 Dec, 2017 1 commit
-
-
* Support rank-0 tensor * fix lint
Tianqi Chen committed
-
- 27 Nov, 2017 1 commit
-
-
ziheng committed
-
- 25 Nov, 2017 1 commit
-
-
* [PASS] Allow compact checking when strides is available * remove assert compact
Tianqi Chen committed
-
- 19 Nov, 2017 1 commit
-
-
* scheduler tweaked for super resolution perf * conv2d_transpose schedule error fixed * nnvm issue #239 fixed
Leyuan Wang committed
-
- 16 Nov, 2017 1 commit
-
-
* scheduler tweaked for super resolution perf * lint error fixed * lint error fixed * conv2d_transpose schedule error fixed
Leyuan Wang committed
-
- 14 Nov, 2017 2 commits
-
-
* [TOPI] Add out_dtype argument for conv2d; Add x86 schedules * Fix * Fix lint * Fix
ziheng committed -
* conv2d perf improved for conv2d_56_64_128, test name added to differentiate workloads * fix lint error
Leyuan Wang committed
-
- 13 Nov, 2017 1 commit
-
-
* Make the tvm conda package build with in-place source and use cmake from conda. * Add a package for topi.
abergeron committed
-
- 09 Nov, 2017 1 commit
-
-
Yizhi Liu committed
-
- 08 Nov, 2017 1 commit
-
-
Leyuan Wang committed
-
- 06 Nov, 2017 1 commit
-
-
Yuwei Hu committed
-
- 03 Nov, 2017 1 commit
-
-
Yuwei Hu committed
-
- 30 Oct, 2017 1 commit
-
-
Leyuan Wang committed
-
- 27 Oct, 2017 1 commit
-
-
Tianqi Chen committed
-
- 26 Oct, 2017 1 commit
-
-
masahi committed
-
- 25 Oct, 2017 1 commit
-
-
Yuwei Hu committed
-
- 23 Oct, 2017 1 commit
-
-
* update topi/cuda schedules to use target.max_num_threads * allow num_thread to be larger than cuda.max_num_threads * remove get_max_num_threads and make it inline
masahi committed
-
- 22 Oct, 2017 1 commit
-
-
Tianqi Chen committed
-
- 15 Oct, 2017 1 commit
-
-
Tianqi Chen committed
-
- 14 Oct, 2017 2 commits
-
-
* [TVM] Introduce target generic dispatch system * fix target warning
Tianqi Chen committed -
masahi committed
-
- 13 Oct, 2017 2 commits