Commits · 2ded2d8caf3279c2fb9dbe16c275f17f8a61d6f2 · wenyuanbo / tic

27 Sep, 2019 2 commits
- [ARITH] Use explicit div mode in python. (#4014) · 2ded2d8c
  Tianqi Chen committed Sep 27, 2019
  
  2ded2d8c Browse Files
- Exposed lowered func to c++ API. (#4012) · 16bed7e6
```
So that you can use: `build_mod_.GetFunction("get_lowered_funcs", false);`
to get lowered_funcs.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
```
  Kimish Patel committed Sep 26, 2019
  16bed7e6 Browse Files
26 Sep, 2019 3 commits

hide psutil (#4013) · 01e53935
Haozheng Fan committed Sep 26, 2019

01e53935 Browse Files
[QNN][Conv2D] Optimize lowering. (#4006) · d7998d39
Animesh Jain committed Sep 26, 2019

d7998d39 Browse Files

[TOPI][x86] Introduce schedule_injective_from_existing and unify external… · b330d301

[TOPI][x86] Introduce schedule_injective_from_existing and unify external schedules for all targets (#3983)

* Fix extern schedule for x86

* Register x86::schedule_extern

* Fix

* Fix

* Replace extern.py with extern.h

* Introduce new generic function schedule_injective_from_existing

* Fix

* Fix

* Add back to C++

* Fix style

* Injective schedule calls local schedule_injective_from_existing

* Fix

* Remove target arg from schedule_injective_from_existing

* Fix docs

* Try to fix unit test

* Fix test

* Fix other tests

* Fix bug

committed Sep 26, 2019

b330d301 Browse Files

25 Sep, 2019 11 commits

[RELAY]impose a max op limit to the op fusion pass (#4002) · d21f0ad5
```
* impose a max op limit to op fusion

* use cross platform data type
```
Yida Wang committed Sep 25, 2019
d21f0ad5 Browse Files

[TOPI] Move conv2d spatial pack schedule to dedicated file (#3972) · f1d2d46b

More schedules are making the conv2d.py file too large, so
we'd like to move the spatial pack schedule to dedicated file
before introducing NHWC schedule. No logic change in this patch.

committed Sep 25, 2019

f1d2d46b Browse Files

Revert "Added tesnorizeation for avx2 based gemm. (#3982)" (#4007) · 4a3abb94
```
This reverts commit 23727eb4.
```
Tianqi Chen committed Sep 25, 2019
4a3abb94 Browse Files
remove FLOP computation for 3rd party lib call (#4005) · 5f19e5a8
Cody Hao Yu committed Sep 25, 2019

5f19e5a8 Browse Files
[ARITH] Refactor to use explicit div/mod functions instead of operators. (#4000) · f0079a57
```
* [ARITH] Use explicit div/mod functions instead of operators.

* fix pooling case
```
Tianqi Chen committed Sep 25, 2019
f0079a57 Browse Files

Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. (#4001) · 17c2c0a1

* Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Added python binding. Added test.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed Sep 25, 2019

17c2c0a1 Browse Files

Change Vivado install instructions to version 2018.3 (#4003) · 9756b067
Philipp Krones committed Sep 25, 2019

9756b067 Browse Files

Added tesnorizeation for avx2 based gemm. (#3982) · 23727eb4

* Added tesnorizeation for avx2 based gemm.

Summary:
Tensorized the same region as avx512. Names produce 16x1 int32 results.
Does by doing two sets of AVX2 instructions to do reduction on 8x4 int8
kernel with 1x4 data.

Test Plan:
on avx2 machine:
python tests/python/contrib/test_gemm_avx2_acc32.py

Reviewers:

Subscribers:

Tasks:

Tags:

* Fix lint errors. Removed commented out code.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed Sep 25, 2019

23727eb4 Browse Files

[COMMUNITY] @yongwww-> reviewer (#3997) · 9baff009
Tianqi Chen committed Sep 24, 2019

9baff009 Browse Files
add parser support for GREATER tflite operator (#3963) · 21353e5f
```
add test for GREATER
```
Ina Dobreva committed Sep 24, 2019
21353e5f Browse Files

Changes to make tensorize work. These changes also fix the previously broken test. (#3981) · b410df8c

* Changes to make tensorize work. These changes also fix the previously
broken test.

Summary:
Tensorize was breaking  for a few reasons.
1)
Assert at: src/op/tensorize.cc:234 CHECK(is_one(e.region[j]->extent))
In some cases this cannot be proven, e.g.:
expected shape=[16, 4], given region=[range(min=((ax1.outer*16)/16), ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)), range(min=((k.outer*4)/4), ext=(((((k.outer*4) + 3)/4) + 1) - k.outer)), range(min=0, ext=16), range(min=0, ext=4)]
The unprovable one is: ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)).
This can be simplified but it is not because to simplify divide, it must
prove ax1.outer > 0 and since it is var it cannot. The fix for this to
just find all the vars in expr in relace them with some const value.

2) Equivalence between tensorized expr and one being asked to tensorize. For example,
the error would be.
TVMError: Check failed: Equal(lhs, rhs):
Failed to match the compute with TensorIntrin tensor_intrin's declaration
provided= reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0),
intrin=  reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0)
Difference is mainly in the source part:
source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))]
source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))]
This was not being simpifiled due to compute_intrin_iter_space (map for
iter var to range) not containing leaf iter vars.

3) Here it fails with:
Check failed: is_one(Simplify(value->shape[i])): Argument b_buffer shape mismatch[16, 4] vs [(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer), (((((k.outer*4) + 3)/4) + 1) - k.outer), 16, 4]
This is in buffer binding where it thinks expected and buffer bound
shape is different. Although if we could simplify expr, this would not
be the case.

Test Plan:
On skylake avx512 machine:
python tests/python/contrib/test_gemm_acc16.py

Reviewers:

Subscribers:

Tasks:

Tags:

* Implemented bounded analyzer which traverses tree and for reduce/for
statements binds the bound of the analyzer. Later this is used to
simplify expressions. Inspired from ir_mutator_with_analyzer

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Addressed comments.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Added ASF header + define macro for the header file: TVM_ARITHMETIC_IR_VISITOR_WITH_ANALYZER_H_
Some lint fixes as well.

* Relax the assumption that dom_map must always contain all leaf itervars.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Disable copy constructor and move to raw ptr.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed Sep 24, 2019

b410df8c Browse Files

24 Sep, 2019 6 commits
- [ARITH] Explicitly state truncdiv/mod in pattern matching. (#3986) · d1830964
```
* [ARITH] Explicitly state truncdiv/mod in pattern matching.

* Fix the dependent cpp test
```
  Tianqi Chen committed Sep 24, 2019
  d1830964 Browse Files
- add parser support for TANH tflite operator (#3996) · c48e1cc1
  Ina Dobreva committed Sep 24, 2019
  
  c48e1cc1 Browse Files
- [Relay] Add new IR pass CombineParallelDense (#3862) · ed9fdfb0
```
* Refactor to create abstract ParallelOpCombiner

* First draft of CombineParallelDense

* Begin to work on tests

* Test

* Refactor to move out more common code

* Clean up

* Fix

* Remove statics

* fix wording

* Start to add combine_parallel_op_batch

* Resolve PR comments

* Resolve PR comments

* dummy change to retrigger CI

* Change special case from bias_add to add

* Revert special case change

* Ignore units check

* dummy change to retrigger CI

* dummy change to re-trigger CI

* Improve docs

* Update docs

* Update docs
```
  Jon Soifer committed Sep 24, 2019
  ed9fdfb0 Browse Files
- Add type solver unit tests for unifying quantified funcs (one bug found) (#3947) · df6f54ac
  Steven S. Lyubomirsky committed Sep 24, 2019
  
  df6f54ac Browse Files
- [Relay][Frontend][ONNX] Add Erf to ONNX frontend (#3988) · ba4d081c
```
* Add Erf to ONNX frontend

* dummy change to retrigger CI
```
  Jon Soifer committed Sep 24, 2019
  ba4d081c Browse Files
- [DOC] Add test script starter command to document (#3993) · 3f7cbed8
  StandbyMe committed Sep 23, 2019
  
  3f7cbed8 Browse Files
23 Sep, 2019 1 commit
- [QNN] Fix padding changes due to #3739 (#3989) · 8eb3157a
  Animesh Jain committed Sep 23, 2019
  
  8eb3157a Browse Files
22 Sep, 2019 3 commits
- [Rust] Fixes "common" sub crate using nightly and master (#3965) · cb1faf8a
  Paddy Horan committed Sep 22, 2019
  
  cb1faf8a Browse Files
- Qnn fully connected (#3910) · 43f54a58
```
* Qnn Dense layer.

* Reformatting code.

* Reformatting code and making the test case more readable.

* Fixing lint issues.

* Fixing test method names to pass the nose related configurations.

* Aligning the code for code style.
```
  shoubhik committed Sep 22, 2019
  43f54a58 Browse Files
- Add operator `isnan` (#3979) · 16d4da4d
```
* add expr `isnan`

* move to intrinsic

* doc & add to topi

* fix error from ci
```
  Huang, Guangtai committed Sep 22, 2019
  16d4da4d Browse Files
21 Sep, 2019 3 commits
- Add docs for analysis namespace (#3985) · 88cd1b1c
  Zhi committed Sep 22, 2019
  
  88cd1b1c Browse Files
- Enable miopen Group Convolution (#3987) · 0257a88b
```
* enable group conv through miopen

* linter fix
```
  Peter Yeh committed Sep 21, 2019
  0257a88b Browse Files
- add bc for gfx1010 (#3984) · beb1c252
  Peter Yeh committed Sep 21, 2019
  
  beb1c252 Browse Files
20 Sep, 2019 5 commits

[Relay][Frontend][TFLite] frontend operator support: batch_to_space_nd, space_to_batch_nd (#3850) · 4ba911a7

* Fix unittest

* Fix pylint error: Line 915 too long

* Fix the conflicting files

* frontend operator support: space_to_batch_nd

* add test case for frontend operator support: space_to_batch_nd

* add test case for frontend operator support: space_to_batch_nd

* frontend operator support: space_to_batch_nd

* Fix ValueError: don't know how to convert type <class 'numpy.ndarray'> to node

committed Sep 20, 2019

4ba911a7 Browse Files

[Relay][Frontend][ONNX] operator support: Tile (#3941) · 8a2f10e0
```
* [Relay][Frontend][ONNX] operator support: Tile

* Trigger notification
```
Neo Chien committed Sep 20, 2019
8a2f10e0 Browse Files
[ARITH] Add Lowering rule for FloorDiv/Mod (#3976) · d7a09150
```
* [ARITH] Add Lowering rule for FloorDiv/Mod

* add comment about constant folding
```
Tianqi Chen committed Sep 20, 2019
d7a09150 Browse Files

Add support for MXNet pad operator. (#3739) · 719d6d47

MXNet pad is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.pad

Add support for parameter 'None' in MXNet slice operator.

MXNet 'slice' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.slice

Add support for MXNet cos, sin, arctan

MXNet 'cos' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.cos

MXNet 'sin' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.sin

MXNet arctan is descirbed at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.arctan

Add support for MXNet 1D Convolution and 1D Deconvolution

MXNet convolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Convolution

MXNet Deconvolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Deconvolution

committed Sep 19, 2019

719d6d47 Browse Files

[QNN] Renaming tests to follow the Relay nomenclature. (#3975) · 0840b064
Animesh Jain committed Sep 20, 2019

0840b064 Browse Files

19 Sep, 2019 5 commits
- [TOPI] Add proper scheduling for dense on CUDA (#3923) · bec08fec
```
* add proper scheduling for dense on CUDA

* add fallback config and fix unit test

* fix corner cases

* refactoring

* fix bias and add testcase

* let fusion happen
```
  Cody Hao Yu committed Sep 19, 2019
  bec08fec Browse Files
- Remove GTest cmake flag from install docs (#3953) · 1d00c083
  Meghan Cowan committed Sep 19, 2019
  
  1d00c083 Browse Files
- adjust pylint output (#3973) · bbc5fb0e
```
adjust pylint output to show file location to make it possible to locate errors
```
  Ina Dobreva committed Sep 19, 2019
  bbc5fb0e Browse Files
- [Relay] Legalize and AlterOpLayout for Int8 Intel. (#3961) · b0ddcff6
  Animesh Jain committed Sep 19, 2019
  
  b0ddcff6 Browse Files
- [ARITH] Introduce base-class IRMutatorWithAnalyzer for scope dependent analysis (#3969) · 92439166
  Tianqi Chen committed Sep 18, 2019
  
  92439166 Browse Files
18 Sep, 2019 1 commit

[Relay] Add shape check for ConcatenateRel and StackRel (#3699) · cdbf4d85

* [Relay] add shape check for concat

* [Relay] add shape check for stack

* add test case for shape mismatch

* [typo] add the missing assert

* fix lint errors.

* replace int with size_t.

* statically cast param->axis to size_t.

* switch to run_infer_type.

* fix checking for negative index

* add static_cast for param->axis

* merge to latest tvm

* fix lint error

* Fix an error with negative index.

* Update transform.h

* Update transform.cc

committed Sep 18, 2019

cdbf4d85 Browse Files