- 28 Mar, 2020 3 commits
-
-
* [NODE][IR] Introduce StructuralEqual Infra for the Unified IR. This PR introduces a new way to handle structural equality for both TIR and relay nodes in an extensive way. - Each object can now register an optional SEqualReduce function, which describes how to reduce its structural equality to another instance into equality of the children. - Optionally, the object can choose to allow remapping of vars(e.g. function parameters) by calling DefEqual - We implemented a non-recursive structural equality checker that recursively traverses the objects and does the structural equality checking. This PR also fixes a few potential problems in previous relay's AlphaEqual. - In particular, the new structural equality relation will be communicative. - It is can be dangerous to use same_as relation to quickly check equality, demonstrated by the following case. (%x, %y) are shared vars between two functions. - function0: fn (%x, %y) { %x + %y } - function1: fn (%y, %x) { %x + %y } The new structural equal is intented to supersede AlphaEqual and AttrsEqual. Follow-up PRs should be performed to redirect the existing usages, and removes the corresponding implementation. * Update the rule to distinguish between graph node and non-graph nodes. * Refactor the test cases to use structural equal. * address comments * Mark more relay::Expr as graph node, fix a testcase issue(was bug that was not caught by previous alpha equal) * Remove unrelated comment * Fix file comment * Address review comment * Relax condition to fit flaky case
Tianqi Chen committed -
* Fixed conv transpose parsing. * small format change. * Chage test module names. * Simplified test syntax.
Josh Fromm committed -
* Adding support for QNN subtract op * Fixing typo. * Fixing typo. * Fixing lint. * Addressing review comments. * Renaming variables as per convention and renamed QnnBinaryOpTypes -> QnnBinaryOpType * Renaming QnnBinaryOpType to QnnBinaryOpTensorType which now takes the index you want to extract to make the code more readable. * Fixing lint. * Moving common code to macro. * Fixing alignment. * Fixing typo. * Fixing lint. * Renaming method to pass CI.
shoubhik committed
-
- 27 Mar, 2020 3 commits
-
-
* [TOPI][Tensor Core] Optimization of CNNs on Tensor Core #6004 * update conv2d test * # pylint: dense_tensorcore.py * modify * modify conv2d * modify the unclear comment,add shape assertion in conv2d compute,combine general gemm intrinsic * add shape assertion in conv2d compute, combine general gemm intrinsic Co-authored-by: libaihong <libaihong@inspur.com> Co-authored-by: libaihong <61525430+libaihong@users.noreply.github.com>
Shawn-Inspur committed -
'fannotate' in the annotate_target pass was designated as static. This meant that if you use the pass to annotate more than one codegen, its value is not updated when the target changes resulting in incorrect annotation. Change-Id: Ib4f3af5cfbef44f29771818219755198ac313a0e
mbaret committed -
* Support TupleGetItemNode in body of pattern only * Add bn_relu test case for MergeComposite with TupleGetItem * formatting * TupleGetItemNode::make -> TupleGetItem()
Trevor Morris committed
-
- 26 Mar, 2020 6 commits
-
-
* crt error handling * Review comments fixed
Samuel committed -
* register for fast_exp and fast_tanh * Add unit test for fast math * Add unit test for op fast math * Add unit test for op fast math * Add unit tests to guard registering topi schedule for Relay fast_exp and fast_tanh * Fix ident * Fix the indent * Add fast_tanh in the test_fastmath of topi tests
Selo1412 committed -
* [Doc] TVM release process * fix tag * remove things not apply
Yizhi Liu committed -
* [RELAY] Added an AnnotatedRegionSet utility class In many of the passes involved in graph partitioning, we need to extract and manipulate annotated regions. This class simplifies the extraction of regions from a relay expression containing region begin and end annotations as well as providing utility functions to query these regions and merge them. Co-authored-by: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com> Change-Id: Ia912fea0b99f64b6a7197aa6da2347e58f469fbb * Rename fix * Update MakeRegions * Fix __init__ * Indentation * Code style * Remove 'Region' from docs * Overload [] to get region * Use src/dest for MergeRegions * Simplify merge * Tidy const loop vars
mbaret committed -
* [Strategy][ARM CPU] Low the plevel of contrib spatial pack of depthwise convolution * address comments
Zhao Wu committed -
Ruizhe Zhao committed
-
- 25 Mar, 2020 2 commits
-
-
Wuwei Lin committed
-
* [TE][Schedule] Duplicate likely nodes removed * [1] Test case added * [2] Lint error fixed * [3] Review comments handled * [4] Review comments handled
ANSHUMAN TRIPATHY committed
-
- 24 Mar, 2020 8 commits
-
-
* [Torch] Add support for max_pool1d * add test * fix line-too-long * remove wrapper class
Wang Yucheng committed -
* Fix conv2d conversion for group conv * add more comment for clarification
masahi committed -
* [REFACTOR][TIR] Introduce PrimFuncPass. - Introduce PrimFuncPass - Convert one pass to the unified Pass API. * Address comments * Fix comments
Tianqi Chen committed -
windclarion committed
-
Tianqi Chen committed
-
* save * gradient.rly * fix * NOT WORKING: gradient cell pass * test gradient pass * fixed basic call ops * more tests * fix bug * transform calls to one ones_like zero zero_like * maintenance stuff * fix linting * linting * linting * throw default * remove unrelated changes * import gradent.rly in pass * comment * linting * remove changes to test files * move gradient_cell.cc to transforms * revert change * update files with new commits * type * wrapper function to main outermost function type * fix linting * fix unsigned and signed int comparison * review * GetConstructor definition in module and change op comparison * update node instantiations * increase code readability Co-authored-by: Marisa Kirisame <lolisa@marisa.moe>
Andrew Liu committed -
Jared Roesch committed
-
Existing test cases cover this fix. In addition I have added an assert to make sure that the data_min is always less than equal to data_max.
shoubhik committed
-
- 23 Mar, 2020 9 commits
-
-
* add argsort_nms_thrust * consider valid count in thrust nms sort * make thrust optional * typo * typo * fix pylint * address some of the comments * address more comments * fix lint * address more comments * address more comments
Leyuan Wang committed -
* relay Node::make to constructor * patternwildcard * Address comments
Zhi committed -
Tianqi Chen committed
-
* Improve TF control flow major logic * Pass mod into operator convert function * Fix LoopBound * Add more control flow tests * Add two test cases for stridedslice * Fix docstring * Fix lint * Fix import * Fix test assert * Minor fix conv3d * Add more comments * Fix for dilation2d * Change newly added atan * Change newly added unravel
Yao Wang committed -
[Bugfix] Fixed bug where shifting by out-of-bounds value results in no compute code being emitted. (#5115) * Fixed bug where shifting by out-of-bounds RHS values results in LLVM to codegen nothing. Added regression testcase * Updated testcase to be more precise. * Fixed testcase
pankratz committed -
* isfinite doc update * isfinit expr * isfinit expr * isfinite schedule reg * isfinite python binding * isfinite python binding * relay register isfinite * isfinite type relation * intrin isfinite * topi isfinite * testcase topi isfinite * tf frontend isfinite * tf frontend isfinite testcase * test case relay isfinite * small fixes * test forward tf isfinite * test cases injective for cuda * remove float16 test case * add support for isinf * remove unwanted import * fix conflict
Mahesh Ambule committed -
* first cut unravel_index * merge fixes * change rates to dilations * unravel_index op relay, topi, mxnet, tf * doc changes * small changes * remove empty unravel and argwhere attrs * remove empty unravel and argwhere attrs
Mahesh Ambule committed -
* [DOCS] Cleanup docs before rebuild * Ask doxygen to generate svg to minimize the file size
Tianqi Chen committed -
Neo Chien committed
-
- 22 Mar, 2020 5 commits
-
-
- This allows to emit vectorized loads/stores for CUDA math intrinsics. - A few intrinsics should be lowered as CUDAMath not CUDAFastMath ones. - Fixed the code block identation.
Wei Pan committed -
Haichen Shen committed
-
* Fix up the final pieces * Tweak build.rs
Jared Roesch committed -
Tianqi Chen committed
-
* [DOCS] include a tarball of docs during deployment * [DOCS] Add a short security faq
Tianqi Chen committed
-
- 21 Mar, 2020 3 commits
-
-
* [KERAS]Conv3d support added * Keras conv3d testcase added
Samuel committed -
* Update relay docs * any -> py:func * make clean
Zhi committed -
Yagna Srinath Reddy Battula committed
-
- 20 Mar, 2020 1 commit
-
-
* add Atan/Atan2 op * fix bug and testing
Andrew Liu committed
-