Commits · 9756b067956511a32cdee8dbc5030318740b7337 · wenyuanbo / tic

25 Sep, 2019 5 commits

Change Vivado install instructions to version 2018.3 (#4003) · 9756b067
Philipp Krones committed Sep 25, 2019

9756b067 Browse Files

Added tesnorizeation for avx2 based gemm. (#3982) · 23727eb4

* Added tesnorizeation for avx2 based gemm.

Summary:
Tensorized the same region as avx512. Names produce 16x1 int32 results.
Does by doing two sets of AVX2 instructions to do reduction on 8x4 int8
kernel with 1x4 data.

Test Plan:
on avx2 machine:
python tests/python/contrib/test_gemm_avx2_acc32.py

Reviewers:

Subscribers:

Tasks:

Tags:

* Fix lint errors. Removed commented out code.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed Sep 25, 2019

23727eb4 Browse Files

[COMMUNITY] @yongwww-> reviewer (#3997) · 9baff009
Tianqi Chen committed Sep 24, 2019

9baff009 Browse Files
add parser support for GREATER tflite operator (#3963) · 21353e5f
```
add test for GREATER
```
Ina Dobreva committed Sep 24, 2019
21353e5f Browse Files

Changes to make tensorize work. These changes also fix the previously broken test. (#3981) · b410df8c

* Changes to make tensorize work. These changes also fix the previously
broken test.

Summary:
Tensorize was breaking  for a few reasons.
1)
Assert at: src/op/tensorize.cc:234 CHECK(is_one(e.region[j]->extent))
In some cases this cannot be proven, e.g.:
expected shape=[16, 4], given region=[range(min=((ax1.outer*16)/16), ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)), range(min=((k.outer*4)/4), ext=(((((k.outer*4) + 3)/4) + 1) - k.outer)), range(min=0, ext=16), range(min=0, ext=4)]
The unprovable one is: ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)).
This can be simplified but it is not because to simplify divide, it must
prove ax1.outer > 0 and since it is var it cannot. The fix for this to
just find all the vars in expr in relace them with some const value.

2) Equivalence between tensorized expr and one being asked to tensorize. For example,
the error would be.
TVMError: Check failed: Equal(lhs, rhs):
Failed to match the compute with TensorIntrin tensor_intrin's declaration
provided= reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0),
intrin=  reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0)
Difference is mainly in the source part:
source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))]
source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))]
This was not being simpifiled due to compute_intrin_iter_space (map for
iter var to range) not containing leaf iter vars.

3) Here it fails with:
Check failed: is_one(Simplify(value->shape[i])): Argument b_buffer shape mismatch[16, 4] vs [(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer), (((((k.outer*4) + 3)/4) + 1) - k.outer), 16, 4]
This is in buffer binding where it thinks expected and buffer bound
shape is different. Although if we could simplify expr, this would not
be the case.

Test Plan:
On skylake avx512 machine:
python tests/python/contrib/test_gemm_acc16.py

Reviewers:

Subscribers:

Tasks:

Tags:

* Implemented bounded analyzer which traverses tree and for reduce/for
statements binds the bound of the analyzer. Later this is used to
simplify expressions. Inspired from ir_mutator_with_analyzer

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Addressed comments.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Added ASF header + define macro for the header file: TVM_ARITHMETIC_IR_VISITOR_WITH_ANALYZER_H_
Some lint fixes as well.

* Relax the assumption that dom_map must always contain all leaf itervars.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Disable copy constructor and move to raw ptr.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed Sep 24, 2019

b410df8c Browse Files

24 Sep, 2019 6 commits
- [ARITH] Explicitly state truncdiv/mod in pattern matching. (#3986) · d1830964
```
* [ARITH] Explicitly state truncdiv/mod in pattern matching.

* Fix the dependent cpp test
```
  Tianqi Chen committed Sep 24, 2019
  d1830964 Browse Files
- add parser support for TANH tflite operator (#3996) · c48e1cc1
  Ina Dobreva committed Sep 24, 2019
  
  c48e1cc1 Browse Files
- [Relay] Add new IR pass CombineParallelDense (#3862) · ed9fdfb0
```
* Refactor to create abstract ParallelOpCombiner

* First draft of CombineParallelDense

* Begin to work on tests

* Test

* Refactor to move out more common code

* Clean up

* Fix

* Remove statics

* fix wording

* Start to add combine_parallel_op_batch

* Resolve PR comments

* Resolve PR comments

* dummy change to retrigger CI

* Change special case from bias_add to add

* Revert special case change

* Ignore units check

* dummy change to retrigger CI

* dummy change to re-trigger CI

* Improve docs

* Update docs

* Update docs
```
  Jon Soifer committed Sep 24, 2019
  ed9fdfb0 Browse Files
- Add type solver unit tests for unifying quantified funcs (one bug found) (#3947) · df6f54ac
  Steven S. Lyubomirsky committed Sep 24, 2019
  
  df6f54ac Browse Files
- [Relay][Frontend][ONNX] Add Erf to ONNX frontend (#3988) · ba4d081c
```
* Add Erf to ONNX frontend

* dummy change to retrigger CI
```
  Jon Soifer committed Sep 24, 2019
  ba4d081c Browse Files
- [DOC] Add test script starter command to document (#3993) · 3f7cbed8
  StandbyMe committed Sep 23, 2019
  
  3f7cbed8 Browse Files
23 Sep, 2019 1 commit
- [QNN] Fix padding changes due to #3739 (#3989) · 8eb3157a
  Animesh Jain committed Sep 23, 2019
  
  8eb3157a Browse Files
22 Sep, 2019 3 commits
- [Rust] Fixes "common" sub crate using nightly and master (#3965) · cb1faf8a
  Paddy Horan committed Sep 22, 2019
  
  cb1faf8a Browse Files
- Qnn fully connected (#3910) · 43f54a58
```
* Qnn Dense layer.

* Reformatting code.

* Reformatting code and making the test case more readable.

* Fixing lint issues.

* Fixing test method names to pass the nose related configurations.

* Aligning the code for code style.
```
  shoubhik committed Sep 22, 2019
  43f54a58 Browse Files
- Add operator `isnan` (#3979) · 16d4da4d
```
* add expr `isnan`

* move to intrinsic

* doc & add to topi

* fix error from ci
```
  Huang, Guangtai committed Sep 22, 2019
  16d4da4d Browse Files
21 Sep, 2019 3 commits
- Add docs for analysis namespace (#3985) · 88cd1b1c
  Zhi committed Sep 22, 2019
  
  88cd1b1c Browse Files
- Enable miopen Group Convolution (#3987) · 0257a88b
```
* enable group conv through miopen

* linter fix
```
  Peter Yeh committed Sep 21, 2019
  0257a88b Browse Files
- add bc for gfx1010 (#3984) · beb1c252
  Peter Yeh committed Sep 21, 2019
  
  beb1c252 Browse Files
20 Sep, 2019 5 commits

[Relay][Frontend][TFLite] frontend operator support: batch_to_space_nd, space_to_batch_nd (#3850) · 4ba911a7

* Fix unittest

* Fix pylint error: Line 915 too long

* Fix the conflicting files

* frontend operator support: space_to_batch_nd

* add test case for frontend operator support: space_to_batch_nd

* add test case for frontend operator support: space_to_batch_nd

* frontend operator support: space_to_batch_nd

* Fix ValueError: don't know how to convert type <class 'numpy.ndarray'> to node

committed Sep 20, 2019

4ba911a7 Browse Files

[Relay][Frontend][ONNX] operator support: Tile (#3941) · 8a2f10e0
```
* [Relay][Frontend][ONNX] operator support: Tile

* Trigger notification
```
Neo Chien committed Sep 20, 2019
8a2f10e0 Browse Files
[ARITH] Add Lowering rule for FloorDiv/Mod (#3976) · d7a09150
```
* [ARITH] Add Lowering rule for FloorDiv/Mod

* add comment about constant folding
```
Tianqi Chen committed Sep 20, 2019
d7a09150 Browse Files

Add support for MXNet pad operator. (#3739) · 719d6d47

MXNet pad is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.pad

Add support for parameter 'None' in MXNet slice operator.

MXNet 'slice' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.slice

Add support for MXNet cos, sin, arctan

MXNet 'cos' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.cos

MXNet 'sin' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.sin

MXNet arctan is descirbed at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.arctan

Add support for MXNet 1D Convolution and 1D Deconvolution

MXNet convolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Convolution

MXNet Deconvolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Deconvolution

committed Sep 19, 2019

719d6d47 Browse Files

[QNN] Renaming tests to follow the Relay nomenclature. (#3975) · 0840b064
Animesh Jain committed Sep 20, 2019

0840b064 Browse Files

19 Sep, 2019 5 commits
- [TOPI] Add proper scheduling for dense on CUDA (#3923) · bec08fec
```
* add proper scheduling for dense on CUDA

* add fallback config and fix unit test

* fix corner cases

* refactoring

* fix bias and add testcase

* let fusion happen
```
  Cody Hao Yu committed Sep 19, 2019
  bec08fec Browse Files
- Remove GTest cmake flag from install docs (#3953) · 1d00c083
  Meghan Cowan committed Sep 19, 2019
  
  1d00c083 Browse Files
- adjust pylint output (#3973) · bbc5fb0e
```
adjust pylint output to show file location to make it possible to locate errors
```
  Ina Dobreva committed Sep 19, 2019
  bbc5fb0e Browse Files
- [Relay] Legalize and AlterOpLayout for Int8 Intel. (#3961) · b0ddcff6
  Animesh Jain committed Sep 19, 2019
  
  b0ddcff6 Browse Files
- [ARITH] Introduce base-class IRMutatorWithAnalyzer for scope dependent analysis (#3969) · 92439166
  Tianqi Chen committed Sep 18, 2019
  
  92439166 Browse Files
18 Sep, 2019 3 commits

[Relay] Add shape check for ConcatenateRel and StackRel (#3699) · cdbf4d85

* [Relay] add shape check for concat

* [Relay] add shape check for stack

* add test case for shape mismatch

* [typo] add the missing assert

* fix lint errors.

* replace int with size_t.

* statically cast param->axis to size_t.

* switch to run_infer_type.

* fix checking for negative index

* add static_cast for param->axis

* merge to latest tvm

* fix lint error

* Fix an error with negative index.

* Update transform.h

* Update transform.cc

committed Sep 18, 2019

cdbf4d85 Browse Files

[TVM][AutoTVM] cast filepath arguments to string (#3968) · f3abb3d8
Neo Chien committed Sep 18, 2019

f3abb3d8 Browse Files

[Relay] Keras frontend upsample and 1 channel conv2d fixes (#3937) · de123760

* Fix upsample layout in keras frontend.

* Fixed group conv being used instead of conv when channels=1

* Add new conv2d test to catch bugs when channels=1.

committed Sep 18, 2019

de123760 Browse Files

17 Sep, 2019 3 commits
- Adding support to check if an attribute is present or not without having to get the value (#3957) · fc071daf
```
* Adding support to check if an attribute is present or not without having to get the value.

* - Renaming the method to more appropriate name.
```
  shoubhik committed Sep 17, 2019
  fc071daf Browse Files
- [Vulkan] Minor optimization for deferred token lookups. (#3960) · 1fe17d14
```
Use a hash map keyed on the descriptor set to avoid bad asymptotic behaviour.
```
  Andrew Tulloch committed Sep 17, 2019
  1fe17d14 Browse Files
- More friendly error msg; Fix Android Demo LLVM ver (#3962) · a3073457
  Junru Shao committed Sep 17, 2019
  
  a3073457 Browse Files
16 Sep, 2019 6 commits

[TOPI] Setting up AutoTVM template for Intel Int8 conv2D (#3955) · 3edf5260
Animesh Jain committed Sep 17, 2019

3edf5260 Browse Files

[TOPI] Improve conv2d_transpose schedule on X86 and CUDA (#3948) · c846d17c

* improve conv2d_transpose x86 performance by reusing conv2d schedule

* parallelize across batches to make large-batch conv2d and conv2d_transpose faster

* improve doc for autotvm.task.space.FallbackConfigEntity.fallback_with_reference_log

* add fallback schedule for schedule_conv2d_transpose_nchw_cuda

* fix pylint

* fix pylint

* unify conv2d_transpose declaration in topi.nn and topi.x86

committed Sep 16, 2019

c846d17c Browse Files

[Graph Tuner] Fix benchmark layout in graph tuner (#3926) · b577171d
```
* Fix graph tuner benchmarking layout transform

* Add test
```
Yao Wang committed Sep 17, 2019
b577171d Browse Files
[tvm][codegen] Make buffer auto broadcast independent to the order of input args (#3956) · 8577c81b
```
* [tvm][codegen] Make buffer auto broadcast independent to the order of the input arg

* fix indent
```
Zhi committed Sep 16, 2019
8577c81b Browse Files

[TOPI] operator support: logical_and, logical_or, logical_not (#3929) · ab1853c2

* [TOPI] operator support: logical_and, logical_or, logical_not

* [TOPI] operator support: logical_and, logical_or, logical_not

* [TOPI] fix test cases for operator support: logical_and, logical_or, logical_not

* [TOPI] fix test cases for operator support: logical_not

committed Sep 16, 2019

ab1853c2 Browse Files

[QNN] Legalization for Intel x86 QNN Conv2D (#3896) · 26eaea4a
```
* QNNLegalize for conv2d

* [QNN] Legalization for Intel x86 QNN Conv2D
```
Animesh Jain committed Sep 16, 2019
26eaea4a Browse Files