Commits · 7d911f46c3a3a02cd541435aa2495ceca57a88ba · wenyuanbo / tic

03 Oct, 2019 1 commit
- [Relay][Op] Add instance norm op (#4004) · 7d911f46
```
* [Relay][Op] Add instance norm op

* mend

[Relay][Op] Add instance norm op
```
  bindog committed Oct 02, 2019
  7d911f46 Browse Files
02 Oct, 2019 3 commits
- [QNN][Relay] Calling Dialect passes from inside Relay Build API. (#3971) · 36201fe9
  Animesh Jain committed Oct 02, 2019
  
  36201fe9 Browse Files
- [RELAY/PASS] Fix the extent for the post_stmt in the loop partition (#3734) · a7873b0a
  Umang Yadav committed Oct 02, 2019
  
  a7873b0a Browse Files
- [TF][Op] Op where (#4045) · 59cf5735
```
* [TF][Op] Add TF op Where

* improve tests

* add tests for vm
```
  Wei Chen committed Oct 02, 2019
  59cf5735 Browse Files
01 Oct, 2019 4 commits

Fix split's last factor issue (#4044) · 2d537621
Cody Hao Yu committed Oct 01, 2019

2d537621 Browse Files
[COMMUNITY] ajtulloch -> committer (#4043) · 2f1edb99
Tianqi Chen committed Oct 01, 2019

2f1edb99 Browse Files

[TOPI]Add op argwhere (#3994) · fa4d3ec6

* Add op argwhere

* Move shape func to _algorithm.py

* Add lint rule

* Raise exception if rank is not supportted

* move argwhere to transform

* Add argwhere example

* Fix lint

* Add 1-d support

* cleanup

* Add more dtype support

* CR comment

* Improve error message

* Docs

* raise exception

committed Oct 01, 2019

fa4d3ec6 Browse Files

[topi] add ARM v8.2 udot (uint8) support (#3978) · 5cc17649

* [topi] add ARM v8.2 udot (uint8) support

* fix test case

* fix common conv2d schedule

* add back fp32_time in test

* fix lint

* fix doc, add support for int32_lanes=4, signed int

* fix lint

* add ic_bn % 4 checker in schedule

committed Oct 01, 2019

5cc17649 Browse Files

30 Sep, 2019 5 commits
- [COMMUNITY] anijain2305 -> reviewer (#4036) · 85a1d3ff
  Tianqi Chen committed Oct 01, 2019
  
  85a1d3ff Browse Files
- [QNN] Renaming dense operator. (#4033) · 0cd80478
  Animesh Jain committed Sep 30, 2019
  
  0cd80478 Browse Files
- [Relay][Compile_engine] Int64 shape handling for outputs. (#4031) · d0fe532e
  Animesh Jain committed Sep 30, 2019
  
  d0fe532e Browse Files
- Add dmlc-core to the list of installed header directories. (#4035) · 1bff2c89
```
There are dependencies on dmlc-core in TVM public API headers
(e.g. some headers include dmlc/logging.h) so it needs to be installed
as part of TVM for TVM headers to be actually usable.
```
  ndl committed Sep 30, 2019
  1bff2c89 Browse Files
- [ARITH] migrate indexdiv/mod to floordiv/mod (#4008) · f5f2feea
  Tianqi Chen committed Sep 29, 2019
  
  f5f2feea Browse Files
29 Sep, 2019 3 commits

[Relay] Move prelude to text format (#3939) · 2dac17d8

* Fix parser

* Doc fix

* Add module utility functions necessary for prelude

* Implement prelude in text format

* Remove programmatically constructed prelude defs

* Fix 0-arity type conses in pretty printer and test

* Make prelude loading backwards-compatible

* Fix patterns

* Improve some prelude defs

* Fix `ImportFromStd`

It needs to also follow the "add unchecked, add checked" pattern

* Lint roller

* Woops

* Address feedback

* Fix `test_list_constructor` VM test

* Fix `test_adt.py` failures

committed Sep 29, 2019

2dac17d8 Browse Files

make tvm compilable by gcc 4.9.2 (#4032) · 9b46ace1
```
please see https://stackoverflow.com/a/26949099
```
egolearner committed Sep 29, 2019
9b46ace1 Browse Files

[AUTOTVM][DOCS] Add a link to the defining network description of auto-tuning tutorial (#4023) · 8f18cc44

* [AUTOTVM][DOCS] Add a link to autoTVM tutorial to direct the details of building NN with relay

* [AUTOTVM][DOCS] Add a link to autoTVM tutorial to direct the details of building NN with relay

committed Sep 28, 2019

8f18cc44 Browse Files

28 Sep, 2019 4 commits
- [ARITH] cleanup the indexmod/div on python side (#4028) · f98035b0
  Tianqi Chen committed Sep 28, 2019
  
  f98035b0 Browse Files
- [Fix] Add more pad_mode support for onnx converter (#4029) · bbf82e0e
```
* [Fix] Add more pad_mode support for onnx converter

* robustness fix
```
  bindog committed Sep 28, 2019
  bbf82e0e Browse Files
- Add parser support for ReLU tflite operator (#4022) · 4f712c79
  Ina Dobreva committed Sep 27, 2019
  
  4f712c79 Browse Files
- Additional MXNet Convolution and Deconvolution tests (#4026) · 9151d435
```
Add different batch sizes and channel numbers to
MXNet Convolution and Deconvolution tests.
```
  Alex Gladkov committed Sep 27, 2019
  9151d435 Browse Files
27 Sep, 2019 6 commits
- docs: minor spelling tweaks (#4027) · 18188f4b
  brett koonce committed Sep 27, 2019
  
  18188f4b Browse Files
- [Rust] Fix issue with CPP enums. (#4019) · 368a4ae1
  Paddy Horan committed Sep 27, 2019
  
  368a4ae1 Browse Files
- [DOCKER] make demo images consistent with ci images when possible. (#4024) · c93b69ff
  Tianqi Chen committed Sep 27, 2019
  
  c93b69ff Browse Files
- [Fix]use a more intuitive way to limit the #ops in a group (#4018) · 4b13bf66
```
* use a more intuitive way to limit the #ops in a group

* format
```
  Yida Wang committed Sep 27, 2019
  4b13bf66 Browse Files
- [ARITH] Use explicit div mode in python. (#4014) · 2ded2d8c
  Tianqi Chen committed Sep 27, 2019
  
  2ded2d8c Browse Files
- Exposed lowered func to c++ API. (#4012) · 16bed7e6
```
So that you can use: `build_mod_.GetFunction("get_lowered_funcs", false);`
to get lowered_funcs.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:
```
  Kimish Patel committed Sep 26, 2019
  16bed7e6 Browse Files
26 Sep, 2019 3 commits

hide psutil (#4013) · 01e53935
Haozheng Fan committed Sep 26, 2019

01e53935 Browse Files
[QNN][Conv2D] Optimize lowering. (#4006) · d7998d39
Animesh Jain committed Sep 26, 2019

d7998d39 Browse Files

[TOPI][x86] Introduce schedule_injective_from_existing and unify external… · b330d301

[TOPI][x86] Introduce schedule_injective_from_existing and unify external schedules for all targets (#3983)

* Fix extern schedule for x86

* Register x86::schedule_extern

* Fix

* Fix

* Replace extern.py with extern.h

* Introduce new generic function schedule_injective_from_existing

* Fix

* Fix

* Add back to C++

* Fix style

* Injective schedule calls local schedule_injective_from_existing

* Fix

* Remove target arg from schedule_injective_from_existing

* Fix docs

* Try to fix unit test

* Fix test

* Fix other tests

* Fix bug

committed Sep 26, 2019

b330d301 Browse Files

25 Sep, 2019 11 commits

[RELAY]impose a max op limit to the op fusion pass (#4002) · d21f0ad5
```
* impose a max op limit to op fusion

* use cross platform data type
```
Yida Wang committed Sep 25, 2019
d21f0ad5 Browse Files

[TOPI] Move conv2d spatial pack schedule to dedicated file (#3972) · f1d2d46b

More schedules are making the conv2d.py file too large, so
we'd like to move the spatial pack schedule to dedicated file
before introducing NHWC schedule. No logic change in this patch.

committed Sep 25, 2019

f1d2d46b Browse Files

Revert "Added tesnorizeation for avx2 based gemm. (#3982)" (#4007) · 4a3abb94
```
This reverts commit 23727eb4.
```
Tianqi Chen committed Sep 25, 2019
4a3abb94 Browse Files
remove FLOP computation for 3rd party lib call (#4005) · 5f19e5a8
Cody Hao Yu committed Sep 25, 2019

5f19e5a8 Browse Files
[ARITH] Refactor to use explicit div/mod functions instead of operators. (#4000) · f0079a57
```
* [ARITH] Use explicit div/mod functions instead of operators.

* fix pooling case
```
Tianqi Chen committed Sep 25, 2019
f0079a57 Browse Files

Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding. (#4001) · 17c2c0a1

* Expose llvm.nearbyint intrinsic. This is a faster alternate to rounding.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Added python binding. Added test.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed Sep 25, 2019

17c2c0a1 Browse Files

Change Vivado install instructions to version 2018.3 (#4003) · 9756b067
Philipp Krones committed Sep 25, 2019

9756b067 Browse Files

Added tesnorizeation for avx2 based gemm. (#3982) · 23727eb4

* Added tesnorizeation for avx2 based gemm.

Summary:
Tensorized the same region as avx512. Names produce 16x1 int32 results.
Does by doing two sets of AVX2 instructions to do reduction on 8x4 int8
kernel with 1x4 data.

Test Plan:
on avx2 machine:
python tests/python/contrib/test_gemm_avx2_acc32.py

Reviewers:

Subscribers:

Tasks:

Tags:

* Fix lint errors. Removed commented out code.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed Sep 25, 2019

23727eb4 Browse Files

[COMMUNITY] @yongwww-> reviewer (#3997) · 9baff009
Tianqi Chen committed Sep 24, 2019

9baff009 Browse Files
add parser support for GREATER tflite operator (#3963) · 21353e5f
```
add test for GREATER
```
Ina Dobreva committed Sep 24, 2019
21353e5f Browse Files

Changes to make tensorize work. These changes also fix the previously broken test. (#3981) · b410df8c

* Changes to make tensorize work. These changes also fix the previously
broken test.

Summary:
Tensorize was breaking  for a few reasons.
1)
Assert at: src/op/tensorize.cc:234 CHECK(is_one(e.region[j]->extent))
In some cases this cannot be proven, e.g.:
expected shape=[16, 4], given region=[range(min=((ax1.outer*16)/16), ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)), range(min=((k.outer*4)/4), ext=(((((k.outer*4) + 3)/4) + 1) - k.outer)), range(min=0, ext=16), range(min=0, ext=4)]
The unprovable one is: ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)).
This can be simplified but it is not because to simplify divide, it must
prove ax1.outer > 0 and since it is var it cannot. The fix for this to
just find all the vars in expr in relace them with some const value.

2) Equivalence between tensorized expr and one being asked to tensorize. For example,
the error would be.
TVMError: Check failed: Equal(lhs, rhs):
Failed to match the compute with TensorIntrin tensor_intrin's declaration
provided= reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0),
intrin=  reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0)
Difference is mainly in the source part:
source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))]
source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))]
This was not being simpifiled due to compute_intrin_iter_space (map for
iter var to range) not containing leaf iter vars.

3) Here it fails with:
Check failed: is_one(Simplify(value->shape[i])): Argument b_buffer shape mismatch[16, 4] vs [(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer), (((((k.outer*4) + 3)/4) + 1) - k.outer), 16, 4]
This is in buffer binding where it thinks expected and buffer bound
shape is different. Although if we could simplify expr, this would not
be the case.

Test Plan:
On skylake avx512 machine:
python tests/python/contrib/test_gemm_acc16.py

Reviewers:

Subscribers:

Tasks:

Tags:

* Implemented bounded analyzer which traverses tree and for reduce/for
statements binds the bound of the analyzer. Later this is used to
simplify expressions. Inspired from ir_mutator_with_analyzer

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Addressed comments.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Added ASF header + define macro for the header file: TVM_ARITHMETIC_IR_VISITOR_WITH_ANALYZER_H_
Some lint fixes as well.

* Relax the assumption that dom_map must always contain all leaf itervars.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Disable copy constructor and move to raw ptr.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed Sep 24, 2019

b410df8c Browse Files