- 22 Aug, 2017 3 commits
-
-
Tianqi Chen committed
-
* softmax migrated and test added * pylint error fixed * pylint error fixed
Leyuan Wang committed -
Tianqi Chen committed
-
- 21 Aug, 2017 2 commits
-
-
* modify depthconv padding * fix shared memory overflow in depthconv schedule
Yuwei HU committed -
* update installation guide of windows * update installation doc of windows
Xingjian Shi committed
-
- 20 Aug, 2017 6 commits
-
-
Tianqi Chen committed
-
ziheng committed
-
Add install prerequisites of customized building
Shuai Yuan committed -
* [BUILD][LLVM] Support LLVM mainline 5.0 6.0 * Reduce parallelism
Tianqi Chen committed -
ziheng committed
-
Aditya Atluri committed
-
- 19 Aug, 2017 1 commit
-
-
Yuwei HU committed
-
- 18 Aug, 2017 2 commits
-
-
* conv_nchw parameter updated to the one which generates mobilenet benchmarks, doc typo fixed * removed unused variables
Leyuan Wang committed -
Yuwei HU committed
-
- 17 Aug, 2017 6 commits
-
-
Haichen Shen committed
-
Tianqi Chen committed
-
Tianqi Chen committed
-
Tianqi Chen committed
-
William Moses committed
-
William Moses committed
-
- 16 Aug, 2017 3 commits
-
-
Tianqi Chen committed
-
* [NNPack] Support for threadpool * fix lint * fix lint * Use static class function
ziheng committed -
* rename the nchw and pass the unit test; going to do it for nhwc depthwise * bug with fusion * nchw works fine; nhwc float32 problem remains * still cannot bind them together * fusion works * syntax fix * all bugs fixed; test cases pass * minor fix on nn.h
wetliu committed
-
- 15 Aug, 2017 11 commits
-
-
* [Contrib] CuDNN v7 Support * Add test
ziheng committed -
Tianqi Chen committed
-
Tianqi Chen committed
-
Tianqi Chen committed
-
* __init__ updated * pull request updated * build_module added * typo fixed * another typo fixed * conv2d gpu scheduler for two layouts moved to tvm * changes made according to CR * conv2d_nchw formating updated, conv2d_hwcn tests updated * lint error fixed * element wise operator schedule fusing fixed for conv2d * conv2d_nchw topi test added, all resnet workloads now pass * conv compute lint error fixed * fixed python 3 compatibility problem * conv2d tensor input support added, test typo fixed, ir_pass.Simplify changed to util.get_const_int * fixed channel numer < 4 error, also made sure other splitting factor woudn't be 0
Leyuan Wang committed -
* [TOPI] Add ops compute Remove 'compute' and add assert for safety Add document fix lint fix softmax * fix batch norm
ziheng committed -
Tianqi Chen committed
-
Yuwei HU committed
-
* [TOPI] Move ewise.h -> elemwise.h * fix test
Tianqi Chen committed -
[TOPI] Add broadcast and reduce operators
Xingjian Shi committed -
Tianqi Chen committed
-
- 14 Aug, 2017 5 commits
-
-
Nicolas Vasilache committed
-
* add dilation operators * fix pylint * dilate testcases success * n-D tensor dilation * support arbitrary dimension
Yuwei HU committed -
* [DOC] Include TOPI in doxygen * update
Tianqi Chen committed -
* __init__ updated * pull request updated * build_module added * typo fixed * another typo fixed * conv2d gpu scheduler for two layouts moved to tvm * changes made according to CR * conv2d_nchw formating updated, conv2d_hwcn tests updated * lint error fixed * element wise operator schedule fusing fixed for conv2d * conv2d_nchw topi test added, all resnet workloads now pass * conv compute lint error fixed * fixed python 3 compatibility problem * conv2d tensor input support added, test typo fixed, ir_pass.Simplify changed to util.get_const_int
Leyuan Wang committed -
* [TOPI] Move topi.nn.util to topi.util * update the path
Tianqi Chen committed
-
- 13 Aug, 2017 1 commit
-
-
* [WIP] C++ topi contributions Summary: This diff implements C++ topi contributions for: - relu with parametrix threshold - pad with generic padBefore / padAfter specification - matmult with transposes - conv2d_nchw, conv2d_hwcn with runtime constant padding and strides - depthwise_conv2d_nchw with runtime constant padding and strides - group_conv2d_ngchw with runtime constant padding and strides - broadcast_to a broadcastable shape - broadcast_bop where bop is an usual binary op (+ - * / %) Convolution padding is implemented using the pad operation. To avoid extra memory consumption, it is generally recommended to inline the padding with the autoinliner. Unfortunately in its current form the elemwise checks are too restrictive to allow inlining. So this diff also proposes an extension to LHS injective (i.e. no reduction axis in the current IR design) Test Plan: Tested in C++ testsuite in a separate repository, I am looking for suggestions to quickly spin up some tests for tvm. Reviewers: tqchen Subscribers: Tasks: Tags: Blame Revision: * Review + Lint + GSG C++
Nicolas Vasilache committed
-