Commits · 5a17707089fd0fb23d482d8e3efcd965b7ccdf5d · wenyuanbo / tic

22 Oct, 2019 1 commit
- [relay][vm] Reuse allocated device memory (#4170) · 5a177070
  Zhi committed 5 years ago
  
  5a177070 Browse Directory
21 Oct, 2019 1 commit

[REFACTOR][NODE][RUNTIME] Move Node to the new Object protocol. (#4161) · 7895adb2

* [REFACTOR][NODE][RUNTIME] Move Node to the new Object protocol.

This PR removes the original node system, and make node as a subclass of Object.
This is a major refactor towards a better unified runtime object system.

List of changes in the refactor:

- We now hide data_ field, use Downcast explicitly to get a sub-class object.
- Removed the node system FFI in python.
- Removed the node C API, instead use PackedFunc for list and get attrs.
- Change relay::Op::set_attr_type_key(attr_key_name) to relay::Op::set_attr_type<AttrType>().
  - This change was necessary because of the new Object registration mechanism.
  - Subsequent changes to the op registrations
  - The change revealed a few previous problems that is now fixed.
- Patched up a few missing node type registration.
  - Now we will raise an error if we register object that is not registered.
- The original node.h and container.h are kept in the same location.
- Calling convention: kObjectHandle now equals the old kNodeHandle, kNodeHandle is removed.
- IRFunctor now dispatches on ObjectRef.
- Update to the new type checking API: is_type, derived_from are replaced by IsInstance.
- Removed .hash member function, instead use C++ convention hasher functors.

* Address review comments

committed 5 years ago

7895adb2 Browse Directory

20 Oct, 2019 1 commit

[Refactor] Rename Datatype to ADT (#4156) · 32aad56c

We think it will reduce the confusion with the meaning.

https://discuss.tvm.ai/t/discuss-consider-rename-vm-datatype/4339

committed 5 years ago

32aad56c Browse Directory

18 Oct, 2019 1 commit

Add lift_if_then_else pass (#3865) · 687d4a83

* Add LiftIfThenElse pass

* Add more comments

* Rename and refactor

* Add description for internal data structure

* Rename a test

* Minor change

* Address comments

* Improve update_for

committed 5 years ago

687d4a83 Browse Directory

17 Oct, 2019 1 commit

[relay][vm] Separate VM runtime with executable (#4100) · 4052de6d

* [relay][vm] Separate VM runtime with executable

* Address comments

* move ctx back to vm

* make only vm related fields and methods protected

* integrate seriliaztion/deserialization to executable

* create stream

committed 5 years ago

4052de6d Browse Directory

16 Oct, 2019 2 commits

[RUNTIME] Refactor object python FFI to new protocol. (#4128) · 02c1e117

* [RUNTIME] Refactor object python FFI to new protocol.

This is a pre-req to bring the Node system under object protocol.
Most of the code reflects the current code in the Node system.

- Use new instead of init so subclass can define their own constructors
- Allow register via name, besides type idnex
- Introduce necessary runtime C API functions
- Refactored Tensor and Datatype to directly use constructor.

* address review comments

committed 5 years ago

02c1e117 Browse Directory

[QNN] Change default rouning to UPWARD. (#4131) · 1c0e7435
Animesh Jain committed 5 years ago

1c0e7435 Browse Directory

15 Oct, 2019 1 commit

[RFC][RUNTIME] Introduce new object protocol. (#4115) · a0bd3786

* [RUNTIME] Introduce new object protocol.

This PR introduces a new object protocol to unify the node and object.
We also updated the existing runtime::vm code to make use of the new system.

Update to the node will be done in a follow up PR.

Other changes:

- Remove object related code in json serializer as that code logic was not complete
  and we have a separate serializer for VM, can revisit later.

* address review  comment

* Fix the child slot logic

committed 5 years ago

a0bd3786 Browse Directory

11 Oct, 2019 1 commit

[tvm][any] broadcast with values other than one (#3967) · 9d5cba20

* [tvm][any] broadcast with values other than 1

* Add test for incompatible runtime values

* Remove hybrid script compact buffer binding

* retrigger ci

committed 5 years ago

9d5cba20 Browse Directory

10 Oct, 2019 1 commit

[TOPI] FIFO buffer op, to accelerate sequence modeling with dilated convolutions (#4039) · aa424139

* Add FIFO buffer op to enable explicit computation re-use in convolution

* Add a test

* Add end-to-end test with 1D convolution

* Add a stub in MXNet frontend

* Address reviewer comments

* Add back stub for MXNet frontend

committed 5 years ago

aa424139 Browse Directory

09 Oct, 2019 1 commit

[TVM] Rewrite simplification rule to eliminate unnecessary conditionals. (#4076) · f2abd9f6

The current bounds checking infrastructure inserts checks like:

```
for (i, 0, bounds[n]) {
  if (likely(i < bounds[n]) {
     ...
  }
}
```

into the TVM IR which is currently not removed by simplification infrastructure.
This is a little unclean, as these are trivially true since for a loop var `i`
with a given min and extent, we are guaranteed that `i >= min` and `i < min +
extent`. Thus, we can insert these checks into the IR and use them to eliminate
trivial bounds checks early on.

committed 5 years ago

f2abd9f6 Browse Directory

06 Oct, 2019 1 commit
- [Relay][AlterOp] Improving support for broadcast layout alteration. (#4040) · d703fb4e
  Animesh Jain committed 5 years ago
  
  d703fb4e Browse Directory
03 Oct, 2019 1 commit
- [Relay][Op] Add instance norm op (#4004) · 7d911f46
```
* [Relay][Op] Add instance norm op

* mend

[Relay][Op] Add instance norm op
```
  bindog committed 5 years ago
  7d911f46 Browse Directory
02 Oct, 2019 1 commit
- [QNN][Relay] Calling Dialect passes from inside Relay Build API. (#3971) · 36201fe9
  Animesh Jain committed 5 years ago
  
  36201fe9 Browse Directory
01 Oct, 2019 1 commit

[TOPI]Add op argwhere (#3994) · fa4d3ec6

* Add op argwhere

* Move shape func to _algorithm.py

* Add lint rule

* Raise exception if rank is not supportted

* move argwhere to transform

* Add argwhere example

* Fix lint

* Add 1-d support

* cleanup

* Add more dtype support

* CR comment

* Improve error message

* Docs

* raise exception

committed 5 years ago

fa4d3ec6 Browse Directory

29 Sep, 2019 1 commit

[Relay] Move prelude to text format (#3939) · 2dac17d8

* Fix parser

* Doc fix

* Add module utility functions necessary for prelude

* Implement prelude in text format

* Remove programmatically constructed prelude defs

* Fix 0-arity type conses in pretty printer and test

* Make prelude loading backwards-compatible

* Fix patterns

* Improve some prelude defs

* Fix `ImportFromStd`

It needs to also follow the "add unchecked, add checked" pattern

* Lint roller

* Woops

* Address feedback

* Fix `test_list_constructor` VM test

* Fix `test_adt.py` failures

committed 5 years ago

2dac17d8 Browse Directory

25 Sep, 2019 3 commits

[ARITH] Refactor to use explicit div/mod functions instead of operators. (#4000) · f0079a57
```
* [ARITH] Use explicit div/mod functions instead of operators.

* fix pooling case
```
Tianqi Chen committed 5 years ago
f0079a57 Browse Directory

Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. (#4001) · 17c2c0a1

* Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Added python binding. Added test.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed 5 years ago

17c2c0a1 Browse Directory

Changes to make tensorize work. These changes also fix the previously broken test. (#3981) · b410df8c

* Changes to make tensorize work. These changes also fix the previously
broken test.

Summary:
Tensorize was breaking  for a few reasons.
1)
Assert at: src/op/tensorize.cc:234 CHECK(is_one(e.region[j]->extent))
In some cases this cannot be proven, e.g.:
expected shape=[16, 4], given region=[range(min=((ax1.outer*16)/16), ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)), range(min=((k.outer*4)/4), ext=(((((k.outer*4) + 3)/4) + 1) - k.outer)), range(min=0, ext=16), range(min=0, ext=4)]
The unprovable one is: ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)).
This can be simplified but it is not because to simplify divide, it must
prove ax1.outer > 0 and since it is var it cannot. The fix for this to
just find all the vars in expr in relace them with some const value.

2) Equivalence between tensorized expr and one being asked to tensorize. For example,
the error would be.
TVMError: Check failed: Equal(lhs, rhs):
Failed to match the compute with TensorIntrin tensor_intrin's declaration
provided= reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0),
intrin=  reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0)
Difference is mainly in the source part:
source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))]
source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))]
This was not being simpifiled due to compute_intrin_iter_space (map for
iter var to range) not containing leaf iter vars.

3) Here it fails with:
Check failed: is_one(Simplify(value->shape[i])): Argument b_buffer shape mismatch[16, 4] vs [(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer), (((((k.outer*4) + 3)/4) + 1) - k.outer), 16, 4]
This is in buffer binding where it thinks expected and buffer bound
shape is different. Although if we could simplify expr, this would not
be the case.

Test Plan:
On skylake avx512 machine:
python tests/python/contrib/test_gemm_acc16.py

Reviewers:

Subscribers:

Tasks:

Tags:

* Implemented bounded analyzer which traverses tree and for reduce/for
statements binds the bound of the analyzer. Later this is used to
simplify expressions. Inspired from ir_mutator_with_analyzer

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Addressed comments.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Added ASF header + define macro for the header file: TVM_ARITHMETIC_IR_VISITOR_WITH_ANALYZER_H_
Some lint fixes as well.

* Relax the assumption that dom_map must always contain all leaf itervars.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Disable copy constructor and move to raw ptr.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed 5 years ago

b410df8c Browse Directory

24 Sep, 2019 2 commits

[ARITH] Explicitly state truncdiv/mod in pattern matching. (#3986) · d1830964
```
* [ARITH] Explicitly state truncdiv/mod in pattern matching.

* Fix the dependent cpp test
```
Tianqi Chen committed 5 years ago
d1830964 Browse Directory

[Relay] Add new IR pass CombineParallelDense (#3862) · ed9fdfb0

* Refactor to create abstract ParallelOpCombiner

* First draft of CombineParallelDense

* Begin to work on tests

* Test

* Refactor to move out more common code

* Clean up

* Fix

* Remove statics

* fix wording

* Start to add combine_parallel_op_batch

* Resolve PR comments

* Resolve PR comments

* dummy change to retrigger CI

* Change special case from bias_add to add

* Revert special case change

* Ignore units check

* dummy change to retrigger CI

* dummy change to re-trigger CI

* Improve docs

* Update docs

* Update docs

committed 5 years ago

ed9fdfb0 Browse Directory

22 Sep, 2019 2 commits

Qnn fully connected (#3910) · 43f54a58

* Qnn Dense layer.

* Reformatting code.

* Reformatting code and making the test case more readable.

* Fixing lint issues.

* Fixing test method names to pass the nose related configurations.

* Aligning the code for code style.

committed 5 years ago

43f54a58 Browse Directory

Add operator `isnan` (#3979) · 16d4da4d
```
* add expr `isnan`

* move to intrinsic

* doc & add to topi

* fix error from ci
```
Huang, Guangtai committed 5 years ago
16d4da4d Browse Directory

20 Sep, 2019 2 commits

[ARITH] Add Lowering rule for FloorDiv/Mod (#3976) · d7a09150
```
* [ARITH] Add Lowering rule for FloorDiv/Mod

* add comment about constant folding
```
Tianqi Chen committed 5 years ago
d7a09150 Browse Directory

Add support for MXNet pad operator. (#3739) · 719d6d47

MXNet pad is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.pad

Add support for parameter 'None' in MXNet slice operator.

MXNet 'slice' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.slice

Add support for MXNet cos, sin, arctan

MXNet 'cos' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.cos

MXNet 'sin' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.sin

MXNet arctan is descirbed at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.arctan

Add support for MXNet 1D Convolution and 1D Deconvolution

MXNet convolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Convolution

MXNet Deconvolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Deconvolution

committed 5 years ago

719d6d47 Browse Directory

12 Sep, 2019 2 commits

[RFC] [Contrib] Minimal runtime (~12kb .text on ARMv7/x86) for subset of TVM models (#3567) · 1de52bb0

This is an alternative implementation of a subset of the TVM runtime API (and
graph runtime) that focuses entirely on reducing code size, at the expense of
functionality (no tvm.extern(..) calls via PackedFunc, CPU only, etc). It might
be worth incrementally expanding the surface area if there's interest.

The motivation for this work was seeing what the minimal useful subset of the
TVM runtime is. This is relevant for e.g. super code-size constrained
applications in e.g. embedded/mobile. The current runtime is more like O(100KiB)
or so, so this might be compelling for some users.

The smaller surface area for auditing might make this relevant for
https://github.com/dmlc/tvm/issues/3159, or the usecases I was thinking about in
https://github.com/dmlc/tvm/issues/2523#issuecomment-459165815 re: the Rust
runtime.

The symbols in the tvm::minimalruntime space (i.e. excluding std:: and
picojson::) are about 5KiB, so I think there's a bunch of room here (i.e. we
could replace picojson:: with [`jsmn`](https://zserge.com/jsmn.html) or
something, and we could replace more of the `std::unordered_map` usage, etc with
custom primitives as well (similar to the `DynArray`).

committed 5 years ago

1de52bb0 Browse Directory

[Relay][Module] Refactor the way we interface between different modules of Relay. (#3906) · 4e2d707f

* Module refactor

* Add load module

* Add support for idempotent import

* Tweak load paths

* Move path around

* Expose C++ import functions in Python

* Fix import

* Add doc string

* Fix

* Fix lint

* Fix lint

* Fix test failure

* Add type solver

* Fix lint

committed 5 years ago

4e2d707f Browse Directory

11 Sep, 2019 1 commit
- [Relay] fix exponential blowup in interpreter (#3559) · 54dbcc28
  雾雨魔理沙 committed 5 years ago
  
  54dbcc28 Browse Directory
09 Sep, 2019 1 commit

[Relay/TOPI][Op] Add erf intrinsic and op (#3702) · 2f5b155a

* add more ops

* stop vectorization for erf

* x

* cleanup

* fix

* add whitelist for vectorizable intrin

* add tf converter

* fix dense

* fix

* add missing intrin

* fix mxnet frontend

* fix nvptx

committed 5 years ago

2f5b155a Browse Directory

06 Sep, 2019 1 commit

[Relay] Add ADTs to text format (#3863) · ca0292d8

* Getting closer to having ADT defs

* ADT defs working probly

* Match parsing basipally done

* came to earth in a silver chrome UFO

* match finished?

* All tests but newest are passing

* ADT constructors work

now cleanup?

* Cleanup round 1

* Cleanup round 2

* Cleanup round 3

* Cleanup round 4

* Cleanup round 6

* Cleanup round 7

* Lil grammar fix

* Remove ANTLR Java files

* Lint roller

* Lint roller

* Address feedback

* Test completeness in match test

* Remove unused imports

* Lint roller

* Switch to Rust-style ADT syntax

* Lil fix

* Add dummy `extern type` handler

* Add type arg to test

* Update prelude semantic version

* Repair test

* Fix graph var handling in match

* Revert 's/graph_equal/is_unifiable' change

committed 5 years ago

ca0292d8 Browse Directory

05 Sep, 2019 2 commits
- [Relay] add Tuple pattern (#3596) · 08d92203
```
* implement tuple pattern

* add tuple pattern

* lint;

* lint

* lint

* fix error

* fix

* add test
```
  雾雨魔理沙 committed 5 years ago
  08d92203 Browse Directory
- [QNN] Add - Refactoring to C++ (#3736) · a6bb84a8
  Animesh Jain committed 5 years ago
  
  a6bb84a8 Browse Directory
04 Sep, 2019 1 commit
- [QNN] Convolution 2D Implementation. (#3580) · 0d4870cc
```
Rebasing. Empty commit.

Clang-format styling.
```
  Animesh Jain committed 5 years ago
  0d4870cc Browse Directory
03 Sep, 2019 2 commits

Revert "[Runtime] Allow parameter sharing between modules (#3489)" (#3884) · 6b0359b4
```
This reverts commit 224cc243.
```
Tianqi Chen committed 5 years ago
6b0359b4 Browse Directory

[Runtime] Allow parameter sharing between modules (#3489) · 224cc243

As GraphRuntime does not provide control-flow logics, we have to split
our model to two parts. While we need to share parameters between them
to save memory usage.

Solution:
1) add "lazy_init_input" in graph's attributes
   "attrs": {
     ... ...
     "lazy_init_input": [
       "list_str",
       [
         "p0"
       ]
     ]
    }
2) allow un-allocated NDArray entry in SetupStorage
3) utilize "set_input_zero_copy" function to set parameters

committed 5 years ago

224cc243 Browse Directory

01 Sep, 2019 2 commits

[Relay][Any] Add shape func for dynamic shape (#3606) · eef35a57

* init shape func in interpreter and vm compiler

* Update interpreter

* fix

* lint

* lint

* fix

* remove hack

* update

* fix

* fix

* update

* address comments & update for shape_of

* fix lint

* update

* fix hybrid

* lint

* fix bug & add take shape func

* lint

* lint

* update

* fix flaky test

* add todo

committed 5 years ago

eef35a57 Browse Directory

[Relay] Bitserial ops (#3844) · d08c74ca

* Added arm_cpu NHWC schedules.

* Fixed kernel shape legalization.

* Added bitserial ops to relay.

* Snapshot and more missing files.

* Added dense testing.

* Added tests

* Added ASF header to new files.

* cc lint

* Pylint change.

* pylint fixes.

* Change arm legalize test.

* Added assert check to arm legalize.

* Added better documentation, fixed some bad style

* Reverted arm conv2d nhwc changes.

committed 5 years ago

d08c74ca Browse Directory

31 Aug, 2019 1 commit
- [QNN] Concat - Refactoring to C++ (#3819) · ec7790e3
  Animesh Jain committed 5 years ago
  
  ec7790e3 Browse Directory
30 Aug, 2019 1 commit
- [Relay][QNN] QNNtoRelay & QNNLegalize Pass utility using Relay Legalize API. (#3838) · 671421a8
  Animesh Jain committed 5 years ago
  
  671421a8 Browse Directory
23 Aug, 2019 1 commit
- [CODE] Halide attributions (#3824) · 17f8f96b
  Tianqi Chen committed 5 years ago
  
  17f8f96b Browse Directory