Commits · 5b37d4c15378e872c279ca5edbcb077d1a5fd20b · wenyuanbo / tic

11 Apr, 2020 5 commits

[PYTORCH]Abs, Arange, Softplus ops (#5295) · 5b37d4c1
```
* [PYTHON]Abs, Arange, Softplus ops

* Review comments updated
```
Samuel committed Apr 11, 2020
5b37d4c1 Browse Files

[LLVM] Fix generation of LLVM intrinsics (#5282) · 403929f9

* [LLVM] Fix generation of LLVM intrinsics

The type list in the call to llvm::Intrinsic::getDeclaration is not
the intrinsic's signature, it's the list of overloaded types. Without
this fix, the updated unit test would cause the following error:

TVMError: LLVM module verification failed with the following errors:
Intrinsic name not mangled correctly for type arguments! Should be:
llvm.ctlz.i32
i32 (i32, i1)* @llvm.ctlz.i32.i1

Special handling for llvm.prefetch, sig matching for overloaded ints only

The prefetch intrinsic returns void in LLVM, while it returns i32 in TVM.
This case needs to be handled specially, because rule-based intrinsic
translation would cause invalid LLVM type to be created.

Do the signature matching only for overloaded intrinsics. It's not needed
for non-overloaded ones, so this can save a bit of compile-time.

* Include intrinsic name in the error message

* Fix number of arguments for llvm.fmuladd and llvm.pow

committed Apr 10, 2020

403929f9 Browse Files

[BYOC] Add example of Composite + Annotate for DNNL fused op (#5272) · 3616ebee
```
* merge change from dev branch

* fix string issue

* bring comanic's change back
```
masahi committed Apr 11, 2020
3616ebee Browse Files
[Frontend][TensorFlow]Improve TensorFlow Static Shape Tensor Array (#5243) · 4b27cd14
```
* Support TF Frontend Static TensorArray

* Fix pylint

* Fix lint

* Move get_tensor_array_shape into prelude

* Fix lint

* Fix common
```
Yao Wang committed Apr 11, 2020
4b27cd14 Browse Files

[RUNTIME] Introduce RValue reference(move) support to TypedPackedFunc (#5271) · b72dd9d9

* [RUNTIME] Introduce RValue reference(move) support to TypedPackedFunc

This PR introduces RValue reference support the PackedFunc calling convention to address the above issue.
Specifically, when an argument is a r-value reference, we will use a assign a different type code(`kObjectRValueRefArg`),
and pass `Object**`  (the address to the Object pointer) instead through the values array.
The callee can choose to move out this Object pointer and set the original Object pointer from the caller side to be nullptr.

We also add an experimental move support to the python side(marked as _move so to indicate the dev nature).
This enhancement will enable copy on write optimizations through out the TVM stack.

* Address review comments

* fix compilation

committed Apr 10, 2020

b72dd9d9 Browse Files

10 Apr, 2020 18 commits

[RELAY][FRONTEND][CAFFE2] add Mul and ConvTranspose operator (#5302) · 575d5369
Huacong Yang committed Apr 10, 2020

575d5369 Browse Files

[BYOC] Refine AnnotateTarget and MergeCompilerRegion Passes (#5277) · f506c8b1

* add target to region

* refactor annotate_target

* Make all unit test working

* quick fix

* enable BN, unit test failed

* Fix vm test, unit test. Refactor annotate_target a bit.

* quick fix fusion

* revert fusion change

* style fix

* Refactor merge region pass

* format

* minor fix

* Skip e2e test

* lint

* support AnnotateTarget multiple runs

* Add HasAttr and revert DNNL codegen

* address comment

Co-authored-by: Zhi Chen <chzhi@amazon.com>

committed Apr 10, 2020

f506c8b1 Browse Files

[CI] Fix the hexagon string (#5304) · 5795539c
Tianqi Chen committed Apr 10, 2020

5795539c Browse Files

[Arith] linear system and equation solver (#5171) · e21f2682

* [arith] linear system and equation solver

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>

* avoid constructing analyzer every time

* generate random test cases and address comments

Co-authored-by: Sergei Grechanik <sergei.grechanik@gmail.com>

* rename linear_system to int_constraints

* add comments and use random seed

* message for reporting failure with seed

* add SEqualReduce to IntConstraints; allow variables & ranges to be None

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>
Co-authored-by: Sergei Grechanik <sergei.grechanik@gmail.com>

committed Apr 10, 2020

e21f2682 Browse Files

[PYTORCH]Repeat, Reciprocal & Reshape Op support (#5280) · b236565e
Samuel committed Apr 11, 2020

b236565e Browse Files
[FRONTEND][TENSORFLOW] Fix gather_nd indices (#5279) · 0d1babce
```
* [FRONTEND][TENSORFLOW] Fix gather_nd indices

* retrigger CI
```
MORITA Kazutaka committed Apr 10, 2020
0d1babce Browse Files
Update device_annotation.cc (#5291) · 00014e20
weiliangweiliang committed Apr 10, 2020

00014e20 Browse Files

[REFACTOR][IR] Move to runtime::String (#5276) · 5da361d3

* Use runtime::String

* move string to tvm namespace

* add const char* constructor

* implicit cast from std::string

committed Apr 10, 2020

5da361d3 Browse Files

[NDArray] Set shape_ in NDArray::FromDLPack (#5301) · 48082358
hlu1 committed Apr 10, 2020

48082358 Browse Files

[RUNTIME] Initial implementation of Hexagon runtime support (#5252) · 02d3a59b

* [RUNTIME] Initial implementation of Hexagon runtime support

This is only the TVM runtime. The FastRPC libraries, simulator driver,
etc. will be provided in subsequent commits.

* Fix pylint complaints

* Fix some more pylint complaints

* Add link to the Hexagon SDK website

* Extract VTCM marker into a common variable

* Implement device->device memory copy

* Disable unsigned PDs by default

* Ensure that --hvx_length is present in sim_args if HVX is enabled

* Remove the line about clang from README.md

Apparently things work with libstdc++.

* Mention to set USE_RPC=OFF when building libtvm_runtime.so for Hexagon

* Remember to use codegen_hvx in validate_hvx_length

* Add a line about minimum version of LLVM

committed Apr 10, 2020

02d3a59b Browse Files

[BYOC] Refine DNNL Codegen (#5288) · f0f03647
```
* Improve DNNL

* Add bind params

* trigger ci
```
Cody Yu committed Apr 10, 2020
f0f03647 Browse Files
Adding support for TFLite QnnSub operator. (#5230) · 6ecfaaff
shoubhik committed Apr 09, 2020

6ecfaaff Browse Files

[NODE] General serialzation of leaf objects into bytes. (#5299) · 029388f5

This PR refactors the serialization mechanism to support general
serialization of leaf objects into bytes.

The new feature superceded the original GetGlobalKey feature for singletons.
Added serialization support for runtime::String.

committed Apr 09, 2020

029388f5 Browse Files

Legalize - Use Non-recursive Rewriter. (#5296) · 7d670b04
```
* Legalize - Use Non-recursive Rewriter.

* Cleanup.
```
Animesh Jain committed Apr 09, 2020
7d670b04 Browse Files
[Node] Provide guide to user who has difficulty register SEqualReduce (#5300) · 2b968204
Yizhi Liu committed Apr 09, 2020

2b968204 Browse Files
[TENSORFLOW]reduce ops updated (#5180) · 99c4f9d5
Samuel committed Apr 09, 2020

99c4f9d5 Browse Files

Create loops according to storage scope and thread hierarchies (#5190) · 3d09e64d

* Set IterVar index to 0 for local thread bound IterVars.

* Lint fix

* Use rank instead of scope name to predicate.  Add tests.

* Handle cases other than local/threadIdx.

* Turn warp to the old behavior.

* Modify test to cover global/blockIdx.

* Fix a typo.

* Update test_te_schedule_ops.py with more testing coverage in test_local_stage_predicate; remove test_schedule_schedule_ops.py which was added by mistake.

committed Apr 09, 2020

3d09e64d Browse Files

[CI] Temporary disable CRT test (#5297) · a4321e03
Tianqi Chen committed Apr 09, 2020

a4321e03 Browse Files

09 Apr, 2020 1 commit

[BUGFIX] Fix CRT static test bug (#5293) · 4e05b47e

* [CI][DOCS] Make sure to refresh the cython part

* [BUGFIX] Fix CRT static test bug

* Fix demo_static

* resolve review comment

committed Apr 09, 2020

4e05b47e Browse Files

08 Apr, 2020 6 commits

[BUGFIX][IR] Fix String SEqual (#5275) · ea063888
```
* fix String SEqual

* retrigger ci
```
Zhi committed Apr 08, 2020
ea063888 Browse Files
update compiler version in docs (#5281) · d430d528
Luis Vega committed Apr 08, 2020

d430d528 Browse Files
[LINT] Remove scalalint from lint deps (#5269) · 89da63e2
Haichen Shen committed Apr 07, 2020

89da63e2 Browse Files
[LLVM] Include Support/Host.h for declaration of getDefaultTargetTriple (#5268) · e9c90b72
```
In newer versions of LLVM, this header is no longer included by one of
the already included headers in llvm_common.h, so include it explicitly.
```
Krzysztof Parzyszek committed Apr 07, 2020
e9c90b72 Browse Files
[PYTORCH]celu, gelu, selu activations (#5263) · 989b4819
Samuel committed Apr 08, 2020

989b4819 Browse Files

[RELAY][BYOC] Add support for composite functions in BYOC (#5261) · d2de35eb

* [RELAY] Add 'check' functions to MergeComposite

Currently, MergeComposite can only perform structural
matches. This patch introduces the ability to specify
a 'check' function alongside the pattern which can include
custom logic to determine whether an extracted pattern
should be merged.

For example, if you only want to merge 'NHWC' convolutions,
you can specify a 'check' function which queries the
data_layout value of the extracted pattern (see the test).

Change-Id: I9337ce39f10997051a286d888be38ed0d410d340

* [RELAY] Reformat merge_composite.cc

Run clang-format on merge_composite.cc

Change-Id: I1736bff798cc6d93e57519b08ab3362869098779

* [RELAY][BYOC] Support composite functions in AnnotateTarget

This patch introduces support to annotate composite functions
in the AnnotateTarget pass. In order for a composite function
to be annotated, you should name it according to the style:

{codegen}.{name}
eg. dnnl.add_relu

Change-Id: I74d6c0b506153d866f6d1feb203b32dad59f2871

committed Apr 08, 2020

d2de35eb Browse Files

07 Apr, 2020 10 commits

[RUNTIME] Implement TVMDSOOp(TensorFlow custom op) for TVM runtime (#4459) · 53a4ad35

* Add implementation of TVMDSOOp

* feat: Update cmake script to work with c++11 and in-repo build

* feat: Use libtvm as oplib dependency

* fix: Add missing link dependency to libtvm

* feat: Update tf tvmdso op by review comments

* fix: Update with pr comments

* fix: Fix lint

* feat: Add test script and fix gpu shape

* feat: Add test script and fix gpu shape

* fix: Conditional build tftvm op for gpu

* fix: Conditional build tftvm op for gpu

* fix: Fix pylint of tf_op module.py

* fix: Fix pylint of tf_op module.py

* feat: Conditional enable gpu test for tftvm op

* feat: Conditional enable gpu test for tftvm op

* feat: Add tf_tvmdsoop test script as an app test

* fix: Fix gpu/cpu enabled check on tvm in test script

* fix: Make tf tvmdso op test script runnable with pytest

* remove unused test script test_tfop_module.py

* fix: Remove pushd & popd in tfdsoop test script

* fix: Upgrade tftvmop use python3 to find TensorFlow

* fix: Upgrade tftvmop use python3 to find TensorFlow

* fix: Change target_link_options to target_link_libraries

* fix: Add tftvmop build script's c++ option

* fix: Add tvm library path to tf op test library path

* fix: Debug ci build for tftvm dso op

* fix: Fix cmake error and skip tfop test

* fix: Fix typo and indentation issues

* feat: Use TF list input op def

* fix: Fix style and unexpected changes

Co-authored-by: baoxinqi <baoxinqi@4paradigm.com>
Co-authored-by: Chen Dihao <chendihao@4paradigm.com>
Co-authored-by: wrongtest <wrongtest@4paradigm.com>

committed Apr 07, 2020

53a4ad35 Browse Files

[LLVM] Do not use x86_vcvtph2ps_256 intrinsic with LLVM 11+ (#5267) · 4e007632
```
This intrinsic was removed in LLVM 11.
```
Krzysztof Parzyszek committed Apr 07, 2020
4e007632 Browse Files
[RUNTIME] Quick fix PackedFunc String passing (#5266) · 2942278a
Tianqi Chen committed Apr 07, 2020

2942278a Browse Files

[LLVM] Use llvm::ElementCount with LLVM 11+ when creating vectors (#5265) · df8a6f3b

LLVM 11 added support for scalable vectors, and now the number of
elements in a vector is represented by a llvm::ElementCount class,
not just a number.

committed Apr 07, 2020

df8a6f3b Browse Files

[LLVM] Use llvm::Align with LLVM 11+ to avoid warnings (#5264) · 36ce2e24

LLVM 11 is introducing a separate class to represent alignment.
The functions in IRBuilder that create aligned loads and stores,
and which accept the alignment as an unsigned value have been
deprecated (and now cause warnings to be emitted).

committed Apr 07, 2020

36ce2e24 Browse Files

[uTVM][Runtime] Introduce Virtual Memory Allocator to CRT (#5124) · e11a6092

* initial crt_memory and memory leak fix in graph_runtime

Change-Id: I0f79f909a04d1c677aabb80f202f0612c5ce7f2a

* fix memory leak

Change-Id: I37104c09e28112b1974fa2b064c809d0a8d686c3

* clean up

Change-Id: I039b12015a1d56c8f4120867cd5a5292da34f3e3

* implement vrealloc

Change-Id: I35800470bcbfcf96652494f359711cb4c2d34398

* allocate from stack memory for most of the variables

Change-Id: I72071289843fff4031c0df8796868a0b9fbc57ee

* allocate from stack memory for all of the variables

Change-Id: I32dba85ac1660c77f51c2d0d8ab6436ed0c01c74

* lint

Change-Id: If12cd240685d7791fc60bc0cfb66389cdc186b73

* lint

Change-Id: I7c9d90c11b60b8edda2427ebd189ebe535af2100

* facilitate the growth of TVM_CRT_MAX_NDIM

Change-Id: I939fa43027a5c7529c5c7c6bd8d6e6beb91b7581

* extend test coverage of vmalloc

Change-Id: Ie4ff6b64fdfe6810836cf8fd44dace82a20c4581

* lint

Change-Id: Ibf3c06619ef296df5c49f3945cb6428777781d69

* move logging.h to src

* fix an error in macOS

* remove logging.h

* use cflags for gcc

* fix compilation error

committed Apr 07, 2020

e11a6092 Browse Files

[Relay][OP] Add fast_erf implementation (#5241) · f5b02fdb
```
* add fast erf

* doc

* lint

* fix

* fix indent
```
Haichen Shen committed Apr 07, 2020
f5b02fdb Browse Files
[TIR] Fix perf regression of tir refactor (#5258) · 869b718a
Tianqi Chen committed Apr 07, 2020

869b718a Browse Files
Fixed typo and type mismatch (#5259) · 7902f762
```
Co-authored-by: Adrian Muresan <muresan.adrian.bn@gmail.com>
```
Adrian Muresan committed Apr 07, 2020
7902f762 Browse Files
[Pytorch]layernorm bug fix and testcase updated (#5257) · 8df97ff6
Samuel committed Apr 07, 2020

8df97ff6 Browse Files