Commits · 6ecfaaff1daa40fddaab6d7b17d0563e2b318930 · wenyuanbo / tic

10 Apr, 2020 5 commits

Adding support for TFLite QnnSub operator. (#5230) · 6ecfaaff
shoubhik committed 4 years ago

6ecfaaff Browse Directory

[NODE] General serialzation of leaf objects into bytes. (#5299) · 029388f5

This PR refactors the serialization mechanism to support general
serialization of leaf objects into bytes.

The new feature superceded the original GetGlobalKey feature for singletons.
Added serialization support for runtime::String.

committed 4 years ago

029388f5 Browse Directory

[TENSORFLOW]reduce ops updated (#5180) · 99c4f9d5
Samuel committed 4 years ago

99c4f9d5 Browse Directory

Create loops according to storage scope and thread hierarchies (#5190) · 3d09e64d

* Set IterVar index to 0 for local thread bound IterVars.

* Lint fix

* Use rank instead of scope name to predicate.  Add tests.

* Handle cases other than local/threadIdx.

* Turn warp to the old behavior.

* Modify test to cover global/blockIdx.

* Fix a typo.

* Update test_te_schedule_ops.py with more testing coverage in test_local_stage_predicate; remove test_schedule_schedule_ops.py which was added by mistake.

committed 4 years ago

3d09e64d Browse Directory

[CI] Temporary disable CRT test (#5297) · a4321e03
Tianqi Chen committed 4 years ago

a4321e03 Browse Directory

09 Apr, 2020 1 commit

[BUGFIX] Fix CRT static test bug (#5293) · 4e05b47e

* [CI][DOCS] Make sure to refresh the cython part

* [BUGFIX] Fix CRT static test bug

* Fix demo_static

* resolve review comment

committed 4 years ago

4e05b47e Browse Directory

08 Apr, 2020 3 commits

[BUGFIX][IR] Fix String SEqual (#5275) · ea063888
```
* fix String SEqual

* retrigger ci
```
Zhi committed 4 years ago
ea063888 Browse Directory
[PYTORCH]celu, gelu, selu activations (#5263) · 989b4819
Samuel committed 4 years ago

989b4819 Browse Directory

[RELAY][BYOC] Add support for composite functions in BYOC (#5261) · d2de35eb

* [RELAY] Add 'check' functions to MergeComposite

Currently, MergeComposite can only perform structural
matches. This patch introduces the ability to specify
a 'check' function alongside the pattern which can include
custom logic to determine whether an extracted pattern
should be merged.

For example, if you only want to merge 'NHWC' convolutions,
you can specify a 'check' function which queries the
data_layout value of the extracted pattern (see the test).

Change-Id: I9337ce39f10997051a286d888be38ed0d410d340

* [RELAY] Reformat merge_composite.cc

Run clang-format on merge_composite.cc

Change-Id: I1736bff798cc6d93e57519b08ab3362869098779

* [RELAY][BYOC] Support composite functions in AnnotateTarget

This patch introduces support to annotate composite functions
in the AnnotateTarget pass. In order for a composite function
to be annotated, you should name it according to the style:

{codegen}.{name}
eg. dnnl.add_relu

Change-Id: I74d6c0b506153d866f6d1feb203b32dad59f2871

committed 4 years ago

d2de35eb Browse Directory

07 Apr, 2020 7 commits

[RUNTIME] Implement TVMDSOOp(TensorFlow custom op) for TVM runtime (#4459) · 53a4ad35

* Add implementation of TVMDSOOp

* feat: Update cmake script to work with c++11 and in-repo build

* feat: Use libtvm as oplib dependency

* fix: Add missing link dependency to libtvm

* feat: Update tf tvmdso op by review comments

* fix: Update with pr comments

* fix: Fix lint

* feat: Add test script and fix gpu shape

* feat: Add test script and fix gpu shape

* fix: Conditional build tftvm op for gpu

* fix: Conditional build tftvm op for gpu

* fix: Fix pylint of tf_op module.py

* fix: Fix pylint of tf_op module.py

* feat: Conditional enable gpu test for tftvm op

* feat: Conditional enable gpu test for tftvm op

* feat: Add tf_tvmdsoop test script as an app test

* fix: Fix gpu/cpu enabled check on tvm in test script

* fix: Make tf tvmdso op test script runnable with pytest

* remove unused test script test_tfop_module.py

* fix: Remove pushd & popd in tfdsoop test script

* fix: Upgrade tftvmop use python3 to find TensorFlow

* fix: Upgrade tftvmop use python3 to find TensorFlow

* fix: Change target_link_options to target_link_libraries

* fix: Add tftvmop build script's c++ option

* fix: Add tvm library path to tf op test library path

* fix: Debug ci build for tftvm dso op

* fix: Fix cmake error and skip tfop test

* fix: Fix typo and indentation issues

* feat: Use TF list input op def

* fix: Fix style and unexpected changes

Co-authored-by: baoxinqi <baoxinqi@4paradigm.com>
Co-authored-by: Chen Dihao <chendihao@4paradigm.com>
Co-authored-by: wrongtest <wrongtest@4paradigm.com>

committed 4 years ago

53a4ad35 Browse Directory

[RUNTIME] Quick fix PackedFunc String passing (#5266) · 2942278a
Tianqi Chen committed 4 years ago

2942278a Browse Directory

[uTVM][Runtime] Introduce Virtual Memory Allocator to CRT (#5124) · e11a6092

* initial crt_memory and memory leak fix in graph_runtime

Change-Id: I0f79f909a04d1c677aabb80f202f0612c5ce7f2a

* fix memory leak

Change-Id: I37104c09e28112b1974fa2b064c809d0a8d686c3

* clean up

Change-Id: I039b12015a1d56c8f4120867cd5a5292da34f3e3

* implement vrealloc

Change-Id: I35800470bcbfcf96652494f359711cb4c2d34398

* allocate from stack memory for most of the variables

Change-Id: I72071289843fff4031c0df8796868a0b9fbc57ee

* allocate from stack memory for all of the variables

Change-Id: I32dba85ac1660c77f51c2d0d8ab6436ed0c01c74

* lint

Change-Id: If12cd240685d7791fc60bc0cfb66389cdc186b73

* lint

Change-Id: I7c9d90c11b60b8edda2427ebd189ebe535af2100

* facilitate the growth of TVM_CRT_MAX_NDIM

Change-Id: I939fa43027a5c7529c5c7c6bd8d6e6beb91b7581

* extend test coverage of vmalloc

Change-Id: Ie4ff6b64fdfe6810836cf8fd44dace82a20c4581

* lint

Change-Id: Ibf3c06619ef296df5c49f3945cb6428777781d69

* move logging.h to src

* fix an error in macOS

* remove logging.h

* use cflags for gcc

* fix compilation error

committed 4 years ago

e11a6092 Browse Directory

[Relay][OP] Add fast_erf implementation (#5241) · f5b02fdb
```
* add fast erf

* doc

* lint

* fix

* fix indent
```
Haichen Shen committed 4 years ago
f5b02fdb Browse Directory
[TIR] Fix perf regression of tir refactor (#5258) · 869b718a
Tianqi Chen committed 4 years ago

869b718a Browse Directory
[Pytorch]layernorm bug fix and testcase updated (#5257) · 8df97ff6
Samuel committed 4 years ago

8df97ff6 Browse Directory
[TFLITE]Hard Swish & MobilnetV3 model testing (#5239) · 608e9458
```
* [TFLITE]Hard Swish & MobilnetV3 model testing

* CI Failure addressed
```
Samuel committed 4 years ago
608e9458 Browse Directory

06 Apr, 2020 5 commits
- [PYTORCH]LayerNorm support added (#5249) · 0cc26614
  Samuel committed 4 years ago
  
  0cc26614 Browse Directory
- [RUNTIME] Enable auto conversion from str to runtime::String in PackedFunc, move… · 5e50f476
```
[RUNTIME] Enable auto conversion from str to runtime::String in PackedFunc, move dtype related handling to data_type.h (#5251)
```
  Tianqi Chen committed 4 years ago
  5e50f476 Browse Directory
- fix lower_warp_memory (#5247) · f31df01e
  Tang, Shizhi committed 4 years ago
  
  f31df01e Browse Directory
- [CI] Update MxNet to 1.6.0 with MKL (#5240) · 41b8fd1e
  Haichen Shen committed 4 years ago
  
  41b8fd1e Browse Directory
- [Runtime][Contrib] Support cudnn softmax (#5214) · 799ff356
  Haichen Shen committed 4 years ago
  
  799ff356 Browse Directory
05 Apr, 2020 3 commits

[Relay][Topi][AutoTVM] Winograd support for Conv3D (#5186) · 02eb1833

* Functional conv3d winograd working.

* Formatted python code.

* registered conv3d winograd compute and started adding relay without_weight_transform operator.

* Add topi testing for conv3d winograd.

* Format file.

* small tweak to unrolling to prevent build sticking.

* Refactoring convolution ops in relay.

* Refactored relay convolutions.

* Bug fixes.

* Fixed static bug in convolution.

* Added conv3d alter op layout and related support.

* Bug fixes and testing done.

* Fix a few autotvm bugs.

* Drop silly debug print.

* Removed debug_skip_region.

* Add variant of conv3d_winograd that doesn't transform depth.

* initial infrastructure done for depthless conv.

* Fix no_depth schedule bugs.

* automatic topi switching between depth and depthless winograd.

* Fixed bug in schedule.

* lint fixes.

* Removed indents in convolution.cc

* missed a few indents oops.

* fixed flop count.

* One more small tweak.

* Change kernel pack inner axes order.

* Style changes.

* Comment fixes.

committed 4 years ago

02eb1833 Browse Directory

[Relay][ADT]Static Tensor Array (#5103) · b5352ee2

* Add other static tensor array ops

* Add tensor array get data

* Minor refactor

* Fix pylint

* Update docstring

* Make get data more generic

* Improve test

* Improve split test

* Improve get data

* Minor fix

* Further improvement for static shape

* Improve shape parsing

* Unify get_static_name

committed 4 years ago

b5352ee2 Browse Directory

[REFACTOR][TIR] Migrate all low-level passes to the Pass Manager. (#5233) · e63e08fe

* [REFACTOR][TIR] Migrate all low-level passes to the Pass Manager.

This PR migrates the tvm.lower to return IRModule of PrimFuncs
instead of the LoweredFuncs.

* Remove LoweredFunc.

committed 4 years ago

e63e08fe Browse Directory

04 Apr, 2020 2 commits
- [ONNX]Pool3d & upsample3d op support (#5135) · fd9ce583
```
* [ONNX]Pool3d and Upsample3d op updated

* Pool3d and Upsample3d testcase

* Review comments fixed

* Review comments
```
  Samuel committed 4 years ago
  fd9ce583 Browse Directory
- [TE] Support mixing normal and cross-thread reduction (#5193) · b41f4e55
```
* Support mixing normal and cross-thread reduction

* minor improvements
```
  Tang, Shizhi committed 4 years ago
  b41f4e55 Browse Directory
03 Apr, 2020 5 commits

[REFACTOR][TIR] Migrate most of low-level build to use the Pass Manager. (#5225) · 75e936e1

* [REFACTOR][TIR] Migrate most of low-level build to use the Pass Manager.

- SplitHostDevice
- ThreadSync
- BindDevice
- LowerThreadAllreduce
- Provide a temp fix for printing IRModule with PrimFunc before the formal text printer.

* Address comments, fix tests.

* Fix relay tests

* Explicit move

committed 4 years ago

75e936e1 Browse Directory

[PYTHON] Make IntImm more like an integer (#5232) · 9b274cbb
Tianqi Chen committed 4 years ago

9b274cbb Browse Directory

[RELAY] Non-recursive Graph Vistor and Rewriter (#4886) · 7de8a539

* First pass a defining a non-recursive Graph Vistor and Rewriter

autoformat

remove a currently empty test until testing is solidfied

* Make CalcDep from Dead Code Elimination non-recursive

* Partially working, not passing all tests yet

passes tests when disabling GetExprRefCount, I think I have a bug in visit counting

fix GetExprRefCount

Fix a subtle bug with nested recursive/non-recursive scopes

* Refactor

* improve comments

* respond to review comments on comments

* Fix a problem with default recursion for dataflow nodes

mark DataflowVisitor methods as override

* implement ScopeMutator

* convert forward_rewrite to ScopeMutator, remove DataflowMutator

* rewrite ExprRewriter and convert fast_math to use it

* switch BiasAddSimplifier to ExprRewriter

fix a clang warning

fix cpp lint

fix doc param error

* respond to review comments

* fix a typo in the iterative looping

* add a regression test for GetExprRefCount issue

* Normalize naming

* fix lint

* First pass a defining a non-recursive Graph Vistor and Rewriter

autoformat

remove a currently empty test until testing is solidfied

* Make CalcDep from Dead Code Elimination non-recursive

* Partially working, not passing all tests yet

passes tests when disabling GetExprRefCount, I think I have a bug in visit counting

fix GetExprRefCount

Fix a subtle bug with nested recursive/non-recursive scopes

* Refactor

* improve comments

* respond to review comments on comments

* Fix a problem with default recursion for dataflow nodes

mark DataflowVisitor methods as override

* implement ScopeMutator

* convert forward_rewrite to ScopeMutator, remove DataflowMutator

* rewrite ExprRewriter and convert fast_math to use it

* switch BiasAddSimplifier to ExprRewriter

fix a clang warning

fix cpp lint

fix doc param error

* respond to review comments

* fix a typo in the iterative looping

* add a regression test for GetExprRefCount issue

* Normalize naming

* fix lint

* respond to review comments

committed 4 years ago

7de8a539 Browse Directory

[KERAS]Upsample3d & ZeroPadding3d op (#5125) · b796c13c

* [KERAS]upsampling3d and zeropadding3d op

* [KERAS]upsampling3d and zeropadding3d test case

* Review comments updated

committed 4 years ago

b796c13c Browse Directory

[CodeGen][CUDA] Fix bugs (#5209) · 316ce055

- Support vectorized casts

- It is incorrect to extract elements from int8x4 with

   0x000000ff & (x >> i * 8)

  as this value is of type int in C/C++. If this expression
  is used for sign extensions, the sign bit will be wrong.
  Simply use C style casts instead and sign bits will just work.

Signed-off-by: Wei Pan <weip@nvidia.com>

committed 4 years ago

316ce055 Browse Directory

02 Apr, 2020 9 commits

[REFACTOR] tvm.hybrid -> te.hybrid (#5223) · 6e1cd825

Rationale: The current hybrid module is more aligned with the te part.
We might consider add a new varient of hybrid script that support the unified IR later.
This refactor paves for the potential later changes.

committed 4 years ago

6e1cd825 Browse Directory

[DOCS] Misc docs improvements (#5222) · 62b3195b
```
- Reduce CI docs task log size.
- Update the relation to halide to the latest state.
```
Tianqi Chen committed 4 years ago
62b3195b Browse Directory
[PYTORCH]AvgPool3d, MaxPool3d and Squeeze op support (#5220) · db535f45
```
* [PYTORCH]AvgPool3d, MaxPool3d and Squeeze op support

* Testcases added

* review comments
```
Samuel committed 4 years ago
db535f45 Browse Directory

[REFACTOR][TIR] Migrate low-level pass functions to Pass Manager, (#5213) · 44bffdb3

- Migrate LowerTVMBultin
- Migrate inferFragment, LowerThreadAllreduce
- Migrate ThreadSync
- Refactor target::Build to directly take IRModule.
- Remove un-used legacy functions.

committed 4 years ago

44bffdb3 Browse Directory

[TIR] Introduce BufferLoad/Store (#5205) · 88d2f34b

Co-authored-by: Siyuan Feng <hzfengsy@sjtu.edu.cn>

This PR introduces BufferLoad/Store to TIR. The new nodes will replace
Provide and Call with Tensor arguments in the subsequent refactors.

committed 4 years ago

88d2f34b Browse Directory

[TIR][PASS] dtype rewrite for indexing variables (#5092) · 4e5c5843
Haozheng Fan committed 4 years ago

4e5c5843 Browse Directory
[REFACTOR][IR] kExternalSymbol -> kGlobalSymbol (#5211) · d2f9af78
```
* expose runtime::String to Python

* kExternalSymbol -> kGlobalSymbol
```
Zhi committed 4 years ago
d2f9af78 Browse Directory

[Frontend][Torch] Fix up graph input handling (#5204) · 03cbf78e

* [Frontend][Torch] Simplify operator input handling

* [Frontend][Torch] Allow user supplied input names to override graph inputs

* Fix pylint issues

* Updates from code review feedback

* Fix tutorial to use shape list input

* Disable intermittent test failure in topi vision test

committed 4 years ago

03cbf78e Browse Directory

[DOCS] Reduce artifcats generated by sphinx gallery (#5208) · 5b857d3c
Tianqi Chen committed 4 years ago

5b857d3c Browse Directory