Commits · f143881314fdc0518a370e6353c6fb6305cedf23 · wenyuanbo / tic

15 Apr, 2020 9 commits

[PYTHON] Enhance with_attr API, cleanup MakeAPILegacy in testcases (#5335) · f1438813
Tianqi Chen committed Apr 15, 2020

f1438813 Browse Files

[TOPI] Improve get_valid_count and nms performance for CUDA (#5339) · d81b006b

* get_valid_count updated to have correct results

* speedup nms

* update nms

* revert back nms

* recover one test for get_valid_count

committed Apr 15, 2020

d81b006b Browse Files

[TOPI] Using x86 schedules for ARM conv2d. (#5334) · 1265983c
Animesh Jain committed Apr 15, 2020

1265983c Browse Files
[PYTORCH]Take, Topk op support (#5332) · b1364ebb
```
* [PYTORCH]take, topk op support

* Ci Failure fix
```
Samuel committed Apr 15, 2020
b1364ebb Browse Files

Windows Support for cpp_rpc (#4857) · afcf9397

* Windows Support for cpp_rpc

* Add missing patches that fix crashes under Windows

* On Windows, use python to untar vs wsl

* remove some CMakeLists.txt stuff

* more minor CMakeLists.txt changes

* Remove items from CMakeLists.txt

* Minor CMakeLists.txt changes

* More minor CMakeLists.txt changes

* Even more minor CMakeLists.txt changes

* Modify readme

committed Apr 15, 2020

afcf9397 Browse Files

[Runtime][Relay][Cleanup] Clean up for memory pass to enable heterogenous… · 9a8ed5b7

[Runtime][Relay][Cleanup] Clean up for memory pass to enable heterogenous execution support. (#5324)

* Cleanup type pack and unpack for tuples.

* Clean up the memory_pass using common helpers

* Clean up memory.cc

* Refactor pass

* Add doc strings

* Fix CPPlint

* Fix PyLint

* Fix

* Apply suggestions from code review

Co-Authored-By: Zhi <5145158+zhiics@users.noreply.github.com>

* Fix typo

Co-authored-by: Zhi <5145158+zhiics@users.noreply.github.com>

committed Apr 14, 2020

9a8ed5b7 Browse Files

[CI] Fix build.sh to propagate --network=host to the docker build command (#5336) · 92c78266

* when passing --net=host to build.sh it needs to be also
   sent as --network=host to "docker build", so that both
   build and run will use the same network configuration

committed Apr 14, 2020

92c78266 Browse Files

[LLVM] Use llvm::FunctionCallee in IRBuilder::CreateCall with LLVM 11+ (#5338) · e7fcd9e3
```
The older variants of CreateCall have been deprecated and were recently
removed from LLVM. This caused compilation failures.
```
Krzysztof Parzyszek committed Apr 14, 2020
e7fcd9e3 Browse Files
[RELAY] Remove re-exports of tvm.transform (#5337) · 275e317c
Tianqi Chen committed Apr 14, 2020

275e317c Browse Files

14 Apr, 2020 6 commits

[TIR] Refactor MakePackedAPI to target dependent stage. (#5326) · f08d5d78

Previously MakePackedAPI was in the target independent stage,
but never the less requires the device_type information that will be
binded at a later target dependent stage.

The previous implementation was due to the limitation of LoweredFunc
which can not carry buffer_map info(so they have to be lowered right away).
This is no longer the case after the unified IR refactor.

This PR migrates MakePackedAPI to a target dependent stage
and removes the un-necessary BindDevice pass.

committed Apr 14, 2020

f08d5d78 Browse Files

[RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops (#5316) · 4720cf85
```
* [RELAY][PYTORCH]isNan, isinf, isfinite, ceil, clamp, round ops

* Review comments
```
Samuel committed Apr 14, 2020
4720cf85 Browse Files
[TE][BuildModule] Fix import in dump pass ir (#5327) · 1df6bb6d
Wuwei Lin committed Apr 14, 2020

1df6bb6d Browse Files
[Frontend|MXNet] SwapAxis operator support (#5246) · b7545eb5
```
* MXNet swap axis

* MXNet swap axis

* swap axis review comment

* swap axis review comment
```
Mahesh Ambule committed Apr 13, 2020
b7545eb5 Browse Files

[CODEGEN][CUDA] Fix vector load (#5226) · d2e58ad2

* Fix high-low bit bug in __pack_half2

* Fix vector load

* Add unit8 support for PrintVecElemLoadExpr and BroadcastNode

committed Apr 13, 2020

d2e58ad2 Browse Files

add memoized expr translator for use by backend codegen (#5325) · 2c1ca60e
masahi committed Apr 13, 2020

2c1ca60e Browse Files

13 Apr, 2020 7 commits

[COMMUNITY] @mbaret -> Reviewer (#5322) · 0ab18036
Tianqi Chen committed Apr 13, 2020

0ab18036 Browse Files

[BYOC] Enhance partitioning and external codegen (#5310) · 5958d60d

* Remove duplicated output args

* address comment

* fix codegen c

* improve comment

* VisitExprDefault_

* deduce type

committed Apr 13, 2020

5958d60d Browse Files

[RUNTIME][IR] Allow non-nullable ObjectRef, introduce Optional<T>. (#5314) · fc75de9d

* [RUNTIME] Allow non-nullable ObjectRef, introduce Optional<T>.

We use ObjectRef and their sub-classes extensively throughout our codebase.
Each of ObjectRef's sub-classes are nullable, which means they can hold nullptr
as their values.

While in some places we need nullptr as an alternative value. The implicit support
for nullptr in all ObjectRef creates additional burdens for the developer
to explicitly check defined in many places of the codebase.

Moreover, it is unclear from the API's intentional point of view whether
we want a nullable object or not-null version(many cases we want the later).

Borrowing existing wisdoms from languages like Rust. We propose to
introduce non-nullable ObjectRef, and Optional<T> container that
represents a nullable variant.

To keep backward compatiblity, we will start by allowing most ObjectRef to be nullable.
However, we should start to use Optional<T> as the type in places where
we know nullable is a requirement. Gradually, we will move most of the ObjectRef
to be non-nullable and use Optional<T> in the nullable cases.

Such explicitness in typing can help reduce the potential problems
in our codebase overall.

Changes in this PR:
- Introduce _type_is_nullable attribute to ObjectRef
- Introduce Optional<T>
- Change String to be non-nullable.
- Change the API of function->GetAttr to return Optional<T>

* Address review comments

* Upgrade all compiler flags to c++14

* Update as per review comment

committed Apr 13, 2020

fc75de9d Browse Files

[Topi] Tensorcore support for Conv3D (#5284) · 3df8d560

* one weird trick.

* Added schedule knob for different workloads.

* Initial conv3d tensorcore working.

* Added conv3d tensorcore strategy.

* Added layout conversion to tensorcore friendly format for conv2d and conv3d.

* Add target name check.

* Fixed bad names and depthwise check.

* Removed duplicated attribute assignment.

committed Apr 13, 2020

3df8d560 Browse Files

[REALY][OP] fix typo (#5315) · 0d48361a
```
Signed-off-by: windclarion <windclarion@gmail.com>
```
windclarion committed Apr 13, 2020
0d48361a Browse Files
[PYTORCH]Reduce_ops support added (#5308) · 6805d543
```
* [PYTORCH]Reduce_ops support added

* Review comments updated

* typo bug in qnn test
```
Samuel committed Apr 13, 2020
6805d543 Browse Files

[Torch] Support Python list, more realistic recurrent networks (#5306) · 0145cd50

* use funcs from prelude, pass around convert_map

* get relay input type from user ishape

* handle tuple unpack

* experimenting with static tensor array

* use prelude concat instead of cons + rev

* minor clean up

* fix layer norm conversion bug, unwrap tensor array

* add infer shape on tensor array

* pass around prelude for now

* compile worked but runtime error

* fix tensor array wrapping

* begin list dynamic test

* is_list_dynamic first version

* finish dynamic list test

* a few fix

* use shape_of function if Any is found

* improve size conversion

* working on adding free vars to loop block

* fixed inlined inner loop issue

* clean up free var handling

* add support for tensor array concat

* adding ta concat on last axis

* fix concat, but got runtime error

* disable concat on axis -1 for now

* add lstm tests

* revert unrelated change

* fix stacked bidir test

* minor fix to test

* relax tol a bit, revert dnnl change to avoid conflict

* simplify infer type, use input tensor shape rather than concat shape

* more shape fix

committed Apr 12, 2020

0145cd50 Browse Files

12 Apr, 2020 5 commits

[Intrinsic] Add log1p, ldexp, atan2, hypot, nextafter, copysign (#5312) · cd0d52da
```
* [Intrinsic] Add log1p, ldexp, atan2, hypot, nextafter, copysign

* Lint
```
Junru Shao committed Apr 12, 2020
cd0d52da Browse Files
[Rust][CI] Restore Rust CI (#5137) · 9c591510
Jared Roesch committed Apr 12, 2020

9c591510 Browse Files
Remove PrimExpr from String (#5311) · 8c31d0dd
Zhi committed Apr 12, 2020

8c31d0dd Browse Files
[Requantize] Cleanup and Optimize Lowering (#5286) · 92d0ec14
```
* Adding Cast back to Int32 in FixedPointMultiply.

* Removing extra clip.

* Fix space.

* Retrigger.

* Retrigger.
```
Animesh Jain committed Apr 11, 2020
92d0ec14 Browse Files

[IR][TRANSFORM] Enable CopyOnWrite for passes. (#5309) · e4b80bda

This PR enables the copy on write optimizations passes:
- Enable COW for IRModule both TIR and relay passes.
- Enabled COW for PrimFunc in TIR passes.

Need more thoughts into whether/how to enable COW
for relay::Function, due to some function passes depend
on the presence of IRModule for context information,
and the std::move of the related function to nullptr
might affect the related behavior.

committed Apr 11, 2020

e4b80bda Browse Files

11 Apr, 2020 5 commits

[PYTORCH]Abs, Arange, Softplus ops (#5295) · 5b37d4c1
```
* [PYTHON]Abs, Arange, Softplus ops

* Review comments updated
```
Samuel committed Apr 11, 2020
5b37d4c1 Browse Files

[LLVM] Fix generation of LLVM intrinsics (#5282) · 403929f9

* [LLVM] Fix generation of LLVM intrinsics

The type list in the call to llvm::Intrinsic::getDeclaration is not
the intrinsic's signature, it's the list of overloaded types. Without
this fix, the updated unit test would cause the following error:

TVMError: LLVM module verification failed with the following errors:
Intrinsic name not mangled correctly for type arguments! Should be:
llvm.ctlz.i32
i32 (i32, i1)* @llvm.ctlz.i32.i1

Special handling for llvm.prefetch, sig matching for overloaded ints only

The prefetch intrinsic returns void in LLVM, while it returns i32 in TVM.
This case needs to be handled specially, because rule-based intrinsic
translation would cause invalid LLVM type to be created.

Do the signature matching only for overloaded intrinsics. It's not needed
for non-overloaded ones, so this can save a bit of compile-time.

* Include intrinsic name in the error message

* Fix number of arguments for llvm.fmuladd and llvm.pow

committed Apr 10, 2020

403929f9 Browse Files

[BYOC] Add example of Composite + Annotate for DNNL fused op (#5272) · 3616ebee
```
* merge change from dev branch

* fix string issue

* bring comanic's change back
```
masahi committed Apr 11, 2020
3616ebee Browse Files
[Frontend][TensorFlow]Improve TensorFlow Static Shape Tensor Array (#5243) · 4b27cd14
```
* Support TF Frontend Static TensorArray

* Fix pylint

* Fix lint

* Move get_tensor_array_shape into prelude

* Fix lint

* Fix common
```
Yao Wang committed Apr 11, 2020
4b27cd14 Browse Files

[RUNTIME] Introduce RValue reference(move) support to TypedPackedFunc (#5271) · b72dd9d9

* [RUNTIME] Introduce RValue reference(move) support to TypedPackedFunc

This PR introduces RValue reference support the PackedFunc calling convention to address the above issue.
Specifically, when an argument is a r-value reference, we will use a assign a different type code(`kObjectRValueRefArg`),
and pass `Object**`  (the address to the Object pointer) instead through the values array.
The callee can choose to move out this Object pointer and set the original Object pointer from the caller side to be nullptr.

We also add an experimental move support to the python side(marked as _move so to indicate the dev nature).
This enhancement will enable copy on write optimizations through out the TVM stack.

* Address review comments

* fix compilation

committed Apr 10, 2020

b72dd9d9 Browse Files

10 Apr, 2020 8 commits

[RELAY][FRONTEND][CAFFE2] add Mul and ConvTranspose operator (#5302) · 575d5369
Huacong Yang committed Apr 10, 2020

575d5369 Browse Files

[BYOC] Refine AnnotateTarget and MergeCompilerRegion Passes (#5277) · f506c8b1

* add target to region

* refactor annotate_target

* Make all unit test working

* quick fix

* enable BN, unit test failed

* Fix vm test, unit test. Refactor annotate_target a bit.

* quick fix fusion

* revert fusion change

* style fix

* Refactor merge region pass

* format

* minor fix

* Skip e2e test

* lint

* support AnnotateTarget multiple runs

* Add HasAttr and revert DNNL codegen

* address comment

Co-authored-by: Zhi Chen <chzhi@amazon.com>

committed Apr 10, 2020

f506c8b1 Browse Files

[CI] Fix the hexagon string (#5304) · 5795539c
Tianqi Chen committed Apr 10, 2020

5795539c Browse Files

[Arith] linear system and equation solver (#5171) · e21f2682

* [arith] linear system and equation solver

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>

* avoid constructing analyzer every time

* generate random test cases and address comments

Co-authored-by: Sergei Grechanik <sergei.grechanik@gmail.com>

* rename linear_system to int_constraints

* add comments and use random seed

* message for reporting failure with seed

* add SEqualReduce to IntConstraints; allow variables & ranges to be None

Co-authored-by: Sergei Grechanik <sergei.grechanik+h@gmail.com>
Co-authored-by: Sergei Grechanik <sergei.grechanik@gmail.com>

committed Apr 10, 2020

e21f2682 Browse Files

[PYTORCH]Repeat, Reciprocal & Reshape Op support (#5280) · b236565e
Samuel committed Apr 11, 2020

b236565e Browse Files
[FRONTEND][TENSORFLOW] Fix gather_nd indices (#5279) · 0d1babce
```
* [FRONTEND][TENSORFLOW] Fix gather_nd indices

* retrigger CI
```
MORITA Kazutaka committed Apr 10, 2020
0d1babce Browse Files
Update device_annotation.cc (#5291) · 00014e20
weiliangweiliang committed Apr 10, 2020

00014e20 Browse Files

[REFACTOR][IR] Move to runtime::String (#5276) · 5da361d3

* Use runtime::String

* move string to tvm namespace

* add const char* constructor

* implicit cast from std::string

committed Apr 10, 2020

5da361d3 Browse Files