- 02 Apr, 2020 9 commits
-
-
- Reduce CI docs task log size. - Update the relation to halide to the latest state.
Tianqi Chen committed -
* [PYTORCH]AvgPool3d, MaxPool3d and Squeeze op support * Testcases added * review comments
Samuel committed -
- Migrate LowerTVMBultin - Migrate inferFragment, LowerThreadAllreduce - Migrate ThreadSync - Refactor target::Build to directly take IRModule. - Remove un-used legacy functions.
Tianqi Chen committed -
Co-authored-by: Siyuan Feng <hzfengsy@sjtu.edu.cn> This PR introduces BufferLoad/Store to TIR. The new nodes will replace Provide and Call with Tensor arguments in the subsequent refactors.
Tianqi Chen committed -
Haozheng Fan committed
-
* expose runtime::String to Python * kExternalSymbol -> kGlobalSymbol
Zhi committed -
* [Frontend][Torch] Simplify operator input handling * [Frontend][Torch] Allow user supplied input names to override graph inputs * Fix pylint issues * Updates from code review feedback * Fix tutorial to use shape list input * Disable intermittent test failure in topi vision test
Jeremy Johnson committed -
Tianqi Chen committed
-
* [REFACTOR][TIR] Introduce ExprDeepEqual, Remove IRDeepCompare This PR introduces ExprDeepEqual which reuses the StructuralEqual infra. We migrated the usecases of ir_pass::Equal to ExprDeepEqual and StructuralEqual. * Address comments
Tianqi Chen committed
-
- 01 Apr, 2020 6 commits
-
-
* [RELAY] Fixed issues with MergeCompilerRegions This PR addresses a few outstanding issues with the implementation of MergeCompilerRegions. In particular, it now handles TupleGetItem nodes properly and other minor bugs related to region merging have been fixed. Change-Id: I07783afc56183a6f798a510209f23b0a5f252255 * Fixed issue using pre-merged regions Change-Id: I0a844ac59bda1089ae0c67cef52f0b0c7ab2cbd7 * Removed some debugging logic Change-Id: Ib6f2eede6f38bbb270073eb8d4c4dc19f60832c6 * Remove default annotations Change-Id: I9b7696a51c95871491cbea33c40f92ec327e417f * Annotate default 'if's Change-Id: I0098bd1bf6788dd6366810dcefa84f1ebbffaab0 * Clang format Change-Id: I944365cd3080a97a9261f643a8f1efa5a63cf82b * Use src/dest in merge Change-Id: Ie43113492bda8f1ce63eaf9615cb645bb9e2ee86 * Fixed partition test Change-Id: I46f9e349b1a813a9140f7e4f8a2241687e2df73b * Removed comments Change-Id: I309afdd1951d7e796e41d13788aa487707e0ac4c
mbaret committed -
MORITA Kazutaka committed
-
* [PYTORCH]Dropouts And InstanceNorm support added * Review comments fixed
Samuel committed -
* [BUGFIX]bugfix in tensorflow space_to_batch_nd * Test case added
Samuel committed -
* [PYTORCH]Activations for pytorch * Review comments updated
Samuel committed -
* [TIR][TRANSFORM] Migrate LowerIntrin * LowerDeviceStorageAccessInfo * Migrate LowerWarpMemory
Tianqi Chen committed
-
- 31 Mar, 2020 5 commits
-
-
* [RELAY] Re-wrote the Graph Partitioner to support multiple outputs Input : A Relay module that have functions with disjoint annotated regions using compiler_begin and compiler_end. There could be multiple outputs. Output : A Relay module with global functions for such disjoint annotated regions with calls inserted at the respective location Dependencies : AnnotatedRegionSet Utility class. Methodology : 1) The AnnotatedRegionSet utility class is able to construct a collection of nodes that are bound by a give annotation -- here we use compiler_begin and compiler_end 2) Initially, for each function in the module AnnotatedRegionSets are populated. 3) Then, Vistor pass is traversed until a compiler_end node is encountered that belongs to a "region". 4) When the first compiler_end of a given annotated region is found, a function is formed and inserted. a) if the region has multiple outputs, a Tuple node (capturing all outputs) is returned. 5) Thereafter, if we encounter an another output of the same annotated region, it is important to note that the function is already formed. Therefore, it will lookup the function and add a TupleGetItemNode is inserted. a) We will use the location index of "rets" of each "Region" of AnnotatedRegionSet as TupleGetItemNode index. 6) Therefore, functions will be created for all annotated regions. The name for each global function is created using "Region" id and the compiler name. Change-Id: I1372f02a845b6d3da03b561763e03a378dca263c * [RELAY] Re-wrote the Graph Partitioner to support multiple outputs *removed the expected use-case as we are taking broken-down PR approach *code style fixes *some trivial one liners * [RELAY] Re-wrote the Graph Partitioner to support multiple outputs *fixed an implicit copy to a move * [RELAY] Re-wrote the Graph Partitioner to support multiple outputs *code style changes for comments *renamed test case multiple outputs --> mixed single multiple outputs Since the existing test case checks for both single and multiple output scenarios *added a new test case with conv2d + batch_norm *some var name changes in the test * [RELAY] Re-wrote the Graph Partitioner to support multiple outputs *rebased
manupa-arm committed -
* [Torch] Add support for split * fix * fix test class
Wang Yucheng committed -
* [KERAS]Pool3d support added * Keras pool3d testcase added
Samuel committed -
* refactor * path udpate
Thierry Moreau committed -
* [TE] reverse-mode autodiff without any optimization Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com> * address review comments * add comments and retrigger CI * move unittest to debug ci * move test back and add seed Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>
Yizhi Liu committed
-
- 30 Mar, 2020 10 commits
-
-
* [VTA] Set the correct type for synchronize * Fix the legacy API * Temporary remove the structural equal
Tianqi Chen committed -
* fix miopen convolutions * fix overly long lines
Thomas Viehmann committed -
* [DOCS] Point docs to the ASF site. We have migrated the main docs to the ASF site, which will be periodically updated using the docs generated by the CI. Points the docs to the ASF version. * [CI] Improve the docs generation script
Tianqi Chen committed -
* [RELAY] Add MergeCompilerRegions pass This pass is part of the flow to support creating compiler regions with multiple outputs. It should be called after AnnotateTarget and will merge together regions that share the same target to create larger compiler regions that can be off-loaded to external codegens. This pass implements an algorithm to ensure that during the merging, no data dependency issues are created. See the tests for an example of this case. Co-authored-by: Ramana Radhakrishnan <ramana.radhakrishnan@arm.com> Co-authored-by: Manupa Karunaratne <manupa.karunaratne@arm.com> Change-Id: Ibd99083564608d888482f57c5080109f3eefec88 * [RELAY] Annotate compiler_ends on each edge This alters the behaviour of the AnnotateTarget pass to enforce the property that all compiler annotations exist along a single data flow edge. Specifically, this means they should have exactly one parent and one child. Change-Id: I0e74803a77767f4f377d17755a13a74a30909797 * Fix comment * Rebase *Node::make * Moved block outside for loop * Code style * Update make API * Remove comment * Remove redundant 'else's * Make one line * Fix comment * RefWrite * Fix merge ordering * Add the RFC example as a test * [FIX] Fixed merging behaviour in AnnotateRegionSet Deleting items from a list while iterating it seems to result in undefined behaviour which sometimes segfaults. This makes sure all the item deletion happens separately. * Added checks * Move comment * Update comments
mbaret committed -
* [TFLITE]TOP_K op parser support * Testcase updated
Samuel committed -
* test file for static link added * rename files * Fixed static linking issue * cleanup * changed to dynamic and static demo * MISRA-C static and dynamic test * cleanup * cleanup * Update README.md * cleanup headers * update readme
Mehrdad Hessar committed -
* [CI] Improve VTA build message and scripts. * Use absolute path to set the env var
Tianqi Chen committed -
lfengad committed
-
* [DOCS] Various sphinx related fix. - Use :ref: for reference. - Use :py:class: to refer to API docs. - Update installation guide to also refer to the download page. - Only move html contents in doxygen. * Address review comments * Update wording
Tianqi Chen committed -
Zhi committed
-
- 29 Mar, 2020 4 commits
-
-
Zhi committed
-
Zhi committed
-
* [BUILD] Fix VTA build in CI * [CI] Move build configuration to shell scripts
Tianqi Chen committed -
Thierry Moreau committed
-
- 28 Mar, 2020 4 commits
-
-
* [NODE][IR] Introduce StructuralHash for the Unified IR. This PR introduces a new way to handle structural hash for the unified IR. - Each object can now register an optional SEqualHash function, which describes how to reduce its structural equality to sequence of hash values. - Optionally, the object can choose to allow labeling of vars(e.g. function parameters) by calling DefHash - We implemented a non-recursive structural hasher that maintains its own stack to traverse te IR. This PR also improves the hash value property from the previous relay's hash utility. In particular, the graph node mode hashs a DAG differently from a tree by attaching an unique occurence index to each graph node. In all of the test cases so far, structural_hash is consistent with structural_equal. - if structrual(x, y) then structural_hash(x) == structural_hash(y) - if structural_hash(x) == structural_hash(y) then highly likely structural_equal(x, y) - hash no collison is found in our testcases. Ideally we should work on automatically generating these functions in the future. * Fix cases for EnvFunc and Array dims * fix testcase * Update src/node/structural_hash.cc Co-Authored-By: 雾雨魔理沙 <lolisa@marisa.moe> Co-authored-by: 雾雨魔理沙 <lolisa@marisa.moe>
Tianqi Chen committed -
* [NODE][IR] Introduce StructuralEqual Infra for the Unified IR. This PR introduces a new way to handle structural equality for both TIR and relay nodes in an extensive way. - Each object can now register an optional SEqualReduce function, which describes how to reduce its structural equality to another instance into equality of the children. - Optionally, the object can choose to allow remapping of vars(e.g. function parameters) by calling DefEqual - We implemented a non-recursive structural equality checker that recursively traverses the objects and does the structural equality checking. This PR also fixes a few potential problems in previous relay's AlphaEqual. - In particular, the new structural equality relation will be communicative. - It is can be dangerous to use same_as relation to quickly check equality, demonstrated by the following case. (%x, %y) are shared vars between two functions. - function0: fn (%x, %y) { %x + %y } - function1: fn (%y, %x) { %x + %y } The new structural equal is intented to supersede AlphaEqual and AttrsEqual. Follow-up PRs should be performed to redirect the existing usages, and removes the corresponding implementation. * Update the rule to distinguish between graph node and non-graph nodes. * Refactor the test cases to use structural equal. * address comments * Mark more relay::Expr as graph node, fix a testcase issue(was bug that was not caught by previous alpha equal) * Remove unrelated comment * Fix file comment * Address review comment * Relax condition to fit flaky case
Tianqi Chen committed -
* Fixed conv transpose parsing. * small format change. * Chage test module names. * Simplified test syntax.
Josh Fromm committed -
* Adding support for QNN subtract op * Fixing typo. * Fixing typo. * Fixing lint. * Addressing review comments. * Renaming variables as per convention and renamed QnnBinaryOpTypes -> QnnBinaryOpType * Renaming QnnBinaryOpType to QnnBinaryOpTensorType which now takes the index you want to extract to make the code more readable. * Fixing lint. * Moving common code to macro. * Fixing alignment. * Fixing typo. * Fixing lint. * Renaming method to pass CI.
shoubhik committed
-
- 27 Mar, 2020 1 commit
-
-
* Support TupleGetItemNode in body of pattern only * Add bn_relu test case for MergeComposite with TupleGetItem * formatting * TupleGetItemNode::make -> TupleGetItem()
Trevor Morris committed
-
- 26 Mar, 2020 1 commit
-
-
* register for fast_exp and fast_tanh * Add unit test for fast math * Add unit test for op fast math * Add unit test for op fast math * Add unit tests to guard registering topi schedule for Relay fast_exp and fast_tanh * Fix ident * Fix the indent * Add fast_tanh in the test_fastmath of topi tests
Selo1412 committed
-