- 23 Mar, 2020 8 commits
-
-
* relay Node::make to constructor * patternwildcard * Address comments
Zhi committed -
Tianqi Chen committed
-
* Improve TF control flow major logic * Pass mod into operator convert function * Fix LoopBound * Add more control flow tests * Add two test cases for stridedslice * Fix docstring * Fix lint * Fix import * Fix test assert * Minor fix conv3d * Add more comments * Fix for dilation2d * Change newly added atan * Change newly added unravel
Yao Wang committed -
[Bugfix] Fixed bug where shifting by out-of-bounds value results in no compute code being emitted. (#5115) * Fixed bug where shifting by out-of-bounds RHS values results in LLVM to codegen nothing. Added regression testcase * Updated testcase to be more precise. * Fixed testcase
pankratz committed -
* isfinite doc update * isfinit expr * isfinit expr * isfinite schedule reg * isfinite python binding * isfinite python binding * relay register isfinite * isfinite type relation * intrin isfinite * topi isfinite * testcase topi isfinite * tf frontend isfinite * tf frontend isfinite testcase * test case relay isfinite * small fixes * test forward tf isfinite * test cases injective for cuda * remove float16 test case * add support for isinf * remove unwanted import * fix conflict
Mahesh Ambule committed -
* first cut unravel_index * merge fixes * change rates to dilations * unravel_index op relay, topi, mxnet, tf * doc changes * small changes * remove empty unravel and argwhere attrs * remove empty unravel and argwhere attrs
Mahesh Ambule committed -
* [DOCS] Cleanup docs before rebuild * Ask doxygen to generate svg to minimize the file size
Tianqi Chen committed -
Neo Chien committed
-
- 22 Mar, 2020 5 commits
-
-
- This allows to emit vectorized loads/stores for CUDA math intrinsics. - A few intrinsics should be lowered as CUDAMath not CUDAFastMath ones. - Fixed the code block identation.
Wei Pan committed -
Haichen Shen committed
-
* Fix up the final pieces * Tweak build.rs
Jared Roesch committed -
Tianqi Chen committed
-
* [DOCS] include a tarball of docs during deployment * [DOCS] Add a short security faq
Tianqi Chen committed
-
- 21 Mar, 2020 3 commits
-
-
* [KERAS]Conv3d support added * Keras conv3d testcase added
Samuel committed -
* Update relay docs * any -> py:func * make clean
Zhi committed -
Yagna Srinath Reddy Battula committed
-
- 20 Mar, 2020 8 commits
-
-
* add Atan/Atan2 op * fix bug and testing
Andrew Liu committed -
masahi committed
-
* bind constant to subgraphs * con -> constant
Zhi committed -
* fix CompilerAttrs * retrigger ci
Zhi committed -
* [TOPI][OP] Use Thrust sort for argsort and topk The current GPU sort implementation (odd-even transposition sort) is too slow when the number of elements is large. This PR introduces Thrust implementation of sort which is much faster. Note that this change requires CMake 3.8 or later since we have to use nvcc to compile a thrust code. * cmake: make CUDA optional * allow .cu file to be into the repository * pylint fix and cleanup * require cmake 3.8 only when thrust is enabled * fix nvcc compiler error when passing -pthread * add missing include * add USE_THRUST option in config.cmake * retrigger CI * retrigger CI
MORITA Kazutaka committed -
* Temporary fix to the stack overflow issue in autotvm task extraction * fix lint * fix graph tuner test
Haichen Shen committed -
yongfeng-nv committed
-
As part of the unified IR refactor. This PR refactors the target codegen to use IRModule containing tir::PrimFuncs. In order to break the refactor into several steps without breaking the codebase, we built an conversion pass to convert Array<LoweredFunc> into IRModule. The follow-up refactors will gradually move the passes covered by IRModule up until we cover all the passes. Then we can remove the additional redundant concepts such as LoweredFunc.
Tianqi Chen committed
-
- 19 Mar, 2020 4 commits
-
-
* fix minor lint issue * add conv3d and adaptive avg pool3d conversion with test * fix max pool handling * add batch norm 3d test * add resnet 3d test * add more conv3d test * clean up batch norm test * add note on disabling inception v3 test * add more tests * add more tests * fix names
masahi committed -
* [DOC] Add doc for Relay op strategy * update * address more comments * update * update
Haichen Shen committed -
* [Relay][Frontend][ONNX] operator support: NonZero * update * Solve the build fail * solve the build fail * Replace ctx_list with tvm.cpu()
Neo Chien committed -
* [ConvertLayout] Support QNN ops. * Changing layouts to C. * Fixing dilation. * Empty commit. Co-authored-by: Ubuntu <ubuntu@ip-172-31-53-55.us-west-2.compute.internal>
Animesh Jain committed
-
- 18 Mar, 2020 7 commits
-
-
Tianqi Chen committed
-
* add missing upcast to avgpool * add avg pool test
masahi committed -
Change-Id: I015b2c66a50b64d0eb2e9efe336f6c18ea1fdc67
lhutton1 committed -
Samuel committed
-
* [CODEGEN][OPENCL] Explicitly cast min/max operands * retrigger CI
MORITA Kazutaka committed -
Zhi committed
-
Zhi committed
-
- 17 Mar, 2020 4 commits
-
-
Tianqi Chen committed
-
Tianqi Chen committed
-
* refactor relay python * revert relay/ir/*.py to relay * Address comments * remove direct access to analysis and transform namespace
Zhi committed -
* update docs for dilation 2d * dilation2d compute * dilation2d register * dilation2d rel compute * dilation2d strategy * dilation2d attrs * dilation2d generic schedule * dilation2d tf frontend support * dilation2d tf frontend test case * dilation2d test cases * pylint fixes * add exception for cuda target * Update docstring * Update docstring * change rates to dilations * removed unused param * merge master * Update nn.py * Update nn.py
Mahesh Ambule committed
-
- 16 Mar, 2020 1 commit
-
-
* [TFLITE]DepthToSpace and SpaceToDepth op parser support * DepthToSpace and SpaceToDepth testcases * Review comments fixed
Samuel committed
-