Commits · 83bef9fff2a07ee4ec7c0623f8510a8ac104dd13 · wenyuanbo / tic

13 Aug, 2019 2 commits

fix some pass docs (#3767) · 8a46444f
Zhi committed 5 years ago

8a46444f Browse Directory

[Relay] SpaceToDepth and MirrorPad Operators (#3718) · 8bd9d4d5

* Added relay and topi mirror_pad operator.

* Added mirror_padding to tensorflow frontend.

* Added mirrorpad testing in tensorflow frontent.

* Added space_to_depth in tf frontend.

* Added tests for spacetodepth.

* spacetodepth bug fix.

* Lint fix

* Added mirror pad python attrs.

* Pad code formatting.

* Syntax improvement

* Hopefully last lint fix

committed 5 years ago

8bd9d4d5 Browse Directory

12 Aug, 2019 1 commit
- Fix the potential index overflow (#3751) · 9161efbc
  Neo Chien committed 5 years ago
  
  9161efbc Browse Directory
11 Aug, 2019 1 commit
- [Relay] Fix Partial Evaluator, Add stricter checking for CheckWellFormed (#3749) · 5cedfba5
```
* aot

* save

* save

* fix test

* remove vta changes

* lint
```
  雾雨魔理沙 committed 5 years ago
  5cedfba5 Browse Directory
09 Aug, 2019 1 commit
- [Relay] [Training] Fix ad for concatenate (#3729) · 52fde8f7
```
* reproduce error

* fix

* lint

* lint
```
  雾雨魔理沙 committed 5 years ago
  52fde8f7 Browse Directory
08 Aug, 2019 1 commit

[QNN] Requantize operator (#3531) · a78adbd5

* [Relay] [Quantization] WIP - Common files for the qauntization work.

* [Relay] [Quantization] WIP - Prototyping requantize op.

* Requantize operator implementation.

Requantize converts one quantized tensor representation to another quantized
representation. The PR has following implementation features

- Requantize operator defined in qnn namespace - relay.qnn.requantize
- Lowering of the requantize to exisiting Relay operators
- Integer fixed point implementation of requantize
    - Two rounding modes - FE_UPWARDS (round towards infinity) and
    FE_AWAY_FROM_ZERO (std::round behavior)
- Floating point implementation as well, that can act as reference or can be
used for devices when FP32 computation is not used.
- Unit test cases

Relevant Issue - https://github.com/dmlc/tvm/issues/2351

Credit to TFLite and GemmLowp to provide reference implementations.

* Typo and lint fixes.

* Doc fix.

* Uncommenting the lint script (fixing mistake).

* Modifying the unit tests.

* Moving C++ files into src/relay/qnn

* Moving python files to python/tvm/relay/qnn. Some minor fixes.

* Moving the attrs.h inside the include directory.

* Pushing files that I forgot earlier. Changing util location.

* Incorporating comments. API change. Lint fixes.

* Modifying the GetFixedPointMultiplierShift API as per comments.

* Forgot the dialect change.

* Changing rewrite to qnn_lower.

* Renaming Quantize to Qnn for clarity.

* Remove use_int_domain.

* Incorportaing review comments.

* Adding API doc for QNN dialect.

* Move the qnn_lower pass to transform namespace.

* Moving from expr to module. Adding namespace in C++.

* Minor sentence rewrites. Added qnn namespace.

* Added the API doc.

* Chanding default out_dtype to int8. Adding a test with in/out_dtype as uint8.

* Style fixes. Better error messages.

* Adding documentation.

* More documentation fixes.

* Adding out dtype check for requantize.

* Adding corner case for FP32 to fixed point conversion.

* Adding extra line.

* Documentation fix.

* Adding static inline.

* Incorporating jackwish comment. Removed idtype from requantize lowering.

* Removing Quantize/Dequantize code. Restricting Requantize to (u)int8/int32.

* Style fixes.

* Fix the docs.

* Move to Legalize API.

committed 5 years ago

a78adbd5 Browse Directory

07 Aug, 2019 1 commit

[Relay/TOPI][Op] Add variance and layer norm op (#3700) · 6b6e3888

* Add LayerNorm op

* update

* fix

* Add mean_std and mean_variance

* add std and update doc

* add license

* x

* lint

* x

* fix

* fix doc

committed 5 years ago

6b6e3888 Browse Directory

06 Aug, 2019 3 commits

[Relay] Legalize pass (#3672) · 79922bd3

* [Relay] Rewrite pass.

This pass transforms an expression to other expression.

This pass has many usecases
 * Replace a expr to another expr, if the other expr has faster performance.
 * For ASICs, we might want to modify the inputs to adapt to the HW support.
 * Alter op layout can work in conjunction with this pass.

The supporting usecase is the Intel i8 x i8 conv. Intel HW supports u8 x i8 conv
in HW. Using this pass, we can replace an i8 x i8 conv to a sequence of
operators where one of the operators is now u8 x i8 conv. This will also help
automatic quantizaion performance.

* Better API name.

* Removing the conv2d legalization for x86. Will send a separate PR.

* Test name changes.

* Registering one funtion to register FTVMLegalize.

* Better comments.

committed 5 years ago

79922bd3 Browse Directory

[Bugfix] Fix the issue that function pass modifies original module (#3712) · 43cc89bf
```
* fix

* fix interpreter
```
Haichen Shen committed 5 years ago
43cc89bf Browse Directory

[Relay] [TOPI] `{relay,topi}.nn.sparse_transpose` for **Square** CSR matrices (#3707) · 3b287c4d

* add build gcn tutorial

* add transpose operator for square sparse matrices

* remove extra files

* change loop tag

* comply with lint

* comply with lint -- line too long

* comply with lint

* lint check

* lint check

* lint check

* apply marisa and theirry's reviews

committed 5 years ago

3b287c4d Browse Directory

05 Aug, 2019 1 commit

[Relay] Partial Evaluator do concatenate, and has better termination checker for scalar. (#3703) · c8654e2a

* save

lint some

lint

lint

add charrnn

save

save

save

remove debug

remove debug

remove space

refactor

save

rewrite dce

* reset files

* join -> meet

* lint

* address review comment

* wordsmith

committed 5 years ago

c8654e2a Browse Directory

03 Aug, 2019 1 commit
- Fix gather_nd in Relay (#3442) · 7ce6a41d
```
* Fix gather_nd in Relay

* Add test cases for gather_nd.
```
  Huilin Qu committed 5 years ago
  7ce6a41d Browse Directory
02 Aug, 2019 2 commits

[Relay] [Error] Fix error in partial evaluator (#3693) · 672203f2
```
* fix

* lint
```
雾雨魔理沙 committed 5 years ago
672203f2 Browse Directory

[Relay][Quantization] KL-divergence-based per-layer calibration (#3538) · 33ab3c60

* [Relay][Quantization] Support floating-point scale

* [Relay][Quantization] KL-divergence calibration on dataset

* Fix unhandled LeftShift case in QuantizeRealize

* Fix lint

* drop QBias

* fix lint

* address comments

* address comments

* Update comments

* address comments

* lint

* kQIdentity = 0

committed 5 years ago

33ab3c60 Browse Directory

01 Aug, 2019 2 commits

Add support for Tensorflow operators log1p, cos, sin (#3614) · d72cdfa6

The patch adds support for Tensorflow operators log1p and cos
Tensorflow log1p is described at https://www.tensorflow.org/api_docs/python/tf/math/log1p
Tensorflow cos is described at https://www.tensorflow.org/api_docs/python/tf/math/cos
Tensorflow sin is described at https://www.tensorflow.org/api_docs/python/tf/math/sin

committed 5 years ago

d72cdfa6 Browse Directory

[Relay] Strict mode in pattern matching (#3620) · 331585f4

* add fatal

lint

lint

lint

do

make completeness check an error

lint

remove fatal

* fix test

* reset parser file

* remove unneeded import

* Update python/tvm/relay/adt.py

Co-Authored-By: Steven S. Lyubomirsky <slyubomirsky@gmail.com>

* Update include/tvm/relay/adt.h

Co-Authored-By: Steven S. Lyubomirsky <slyubomirsky@gmail.com>

* Eliminate trailing whitespace (my fault)

committed 5 years ago

331585f4 Browse Directory

31 Jul, 2019 1 commit
- [Relay][VM] Relay VM serialization (#3647) · 90455121
```
* relay vm serialization

* fix lint

* load params, fix stream

* lint

* fix typo
```
  Zhi committed 5 years ago
  90455121 Browse Directory
25 Jul, 2019 1 commit
- [IR] Make iterators compatible with constructors of STL containers (#3624) · 0858c5ad
  Lianmin Zheng committed 5 years ago
  
  0858c5ad Browse Directory
24 Jul, 2019 2 commits
- init (#3571) · 814554e0
```
quickfix
```
  雾雨魔理沙 committed 5 years ago
  814554e0 Browse Directory
- [TOPI][Relay] max_pool2d & avg_pool2d gradient (#3601) · 5c410037
  Wuwei Lin committed 5 years ago
  
  5c410037 Browse Directory
23 Jul, 2019 3 commits

We observe multiple groups across a range of domains (ASR, NMT, LM, etc), (#3566) · d6dcd6c5

internally and externally, interested in replacing standard dense layers with
block-sparse matrix multiplication layers. The motivations are generally: higher
performance (due to reduction in FLOPs, memory bandwidth/cache footprint),
enabling larger models (e.g. fitting more layers in a given memory budget).

Some public work along these lines:

* https://openai.com/blog/block-sparse-gpu-kernels/
* https://openai.com/blog/sparse-transformer/
* https://arxiv.org/abs/1802.08435
* https://arxiv.org/abs/1711.02782

Various groups have been able to successfully train models with reasonable
levels of sparsity (90%+) with marginal accuracy changes, which suggests
substantial speedups are possible (as this implies a >10x reduction in FLOPs).

It is fairly straightforward to realize these theoretical speedups, see e.g. TVM
benchmarks for Intel CPUs in
https://gist.github.com/ajtulloch/e65f90487bceb8848128e8db582fe902, and CUDA
results in https://github.com/openai/blocksparse, etc.

* https://github.com/openai/blocksparse (CUDA)
* https://software.intel.com/en-us/mkl-developer-reference-c-mkl-bsrmm (MKL BSRM)
* https://docs.scipy.org/doc/scipy-0.14.0/reference/generated/scipy.sparse.bsr_matrix.html (SCIPY BSR representation)

This is extracted from an internal patch we've been using internally. There are
various extensions possible (int8/fp16/bf16, CUDA/other GPU architectures), but
this is a reasonable starting point. This needs more thorough unit test coverage
however.

We follow the conventions established by scipy.sparse.bsr_matrix and other
libraries, see the unit tests for details.

For folks interested in experimenting with scheduling/AutoTVM etc,
https://gist.github.com/ajtulloch/e65f90487bceb8848128e8db582fe902 is a useful
starting point.

committed 5 years ago

d6dcd6c5 Browse Directory

{relay,topi}.reinterpret support (#3599) · 2ed31b24

= Motivation

It's useful to expose the tvm::reinterpret functionality to Relay/TOPI users, as
this allows them to build (fused) operators leveraging the bitwise
reinterpretation of an operator. An example is approximate transcendental
functions, which can be implemented similar to:

```.py
    def C(x):
        return relay.expr.const(x, "float32")

    def approx_exp(x):
        x = relay.minimum(relay.maximum(x, C(-88.0)), C(88.0))
        x = C(127.0) + x * C(1.44269504)
        xf = relay.floor(x)
        i = relay.cast(xf, "int32")
        x = x - xf
        Y = C(0.99992522) + x * (C(0.69583354) + x * (C(0.22606716) + x * C(0.078024523)))
        exponent = relay.left_shift(i, relay.expr.const(23, "int32"))
        exponent = relay.reinterpret(exponent, "float32")
        return exponent * Y

    def approx_sigmoid(x):
        # <2.0e-5 absolute error over [-5, 5]
        y = approx_exp(x)
        return y / (y + C(1.0))

    def approx_tanh(x):
        # <4.0e-5 absolute error over [-5, 5]
        x = x * C(2.0)
        y = approx_exp(x)
        return (y - C(1.0)) / (y + C(1.0))
```

See unit tests for implementations of these approximate transendentals.

committed 5 years ago

2ed31b24 Browse Directory

[Relay] [Training] Allow gradient to return a tuple (#3600) · 9e6a8c0d
雾雨魔理沙 committed 5 years ago

9e6a8c0d Browse Directory

21 Jul, 2019 1 commit
- [CI] Upgrade LLVM envs (#3590) · 4d314833
  Tianqi Chen committed 5 years ago
  
  4d314833 Browse Directory
19 Jul, 2019 2 commits
- [Relay] add some check for the ad algorithm (#3585) · 1a00cab9
```
* do

* fix test
```
  雾雨魔理沙 committed 5 years ago
  1a00cab9 Browse Directory
- [TOPI][RELAY] Add op Size (#3094) · 313bc9de
  Yong Wu committed 5 years ago
  
  313bc9de Browse Directory
18 Jul, 2019 1 commit
- [Relay] parser/pretty printer roundtripping (#3536) · 2973f8a6
  雾雨魔理沙 committed 5 years ago
  
  2973f8a6 Browse Directory
17 Jul, 2019 3 commits
- [Relay][VM]Fix debug statement (#3565) · ca432bd5
```
* [Relay][VM]Fix debug statement

* Change debug statement
```
  Wei Chen committed 5 years ago
  ca432bd5 Browse Directory
- Fix build error (#3552) · 48c1ae45
```
* Fix build error

* comments
```
  Yinghai Lu committed 5 years ago
  48c1ae45 Browse Directory
- fix (#3550) · 6d702ea8
  Haichen Shen committed 5 years ago
  
  6d702ea8 Browse Directory
16 Jul, 2019 1 commit

[Relay][VM] Port VM, VM compiler, and Object into python (#3391) · b6dc7826

* tmp

* Port vm and object to python

* clean up

* update vm build module

* update

* x

* tweak

* cleanup

* update

* fix rebase

* Rename to VMCompiler

* fix

committed 5 years ago

b6dc7826 Browse Directory

12 Jul, 2019 1 commit
- [Relay][Quantization] Fix add_rewrite and UnifyDTypeScale (#3534) · 45878ff2
```
* [Relay][Quantization] Fix issue introduced in #3135

* Recover StopFusion

* Fix fmultiref

* Fix lint
```
  Wuwei Lin committed 5 years ago
  45878ff2 Browse Directory
11 Jul, 2019 1 commit

[INFA][IR] Build and Evolve Low-level IR. Remove HalideIR dep. (#3533) · 0218557c

* [INFA][IR] Build and Evolve Low-level IR. Remove dep from HalideIR.


* Update include/tvm/node/ir_functor.h

Co-Authored-By: Jared Roesch <roeschinc@gmail.com>

* Update include/tvm/node/ir_functor.h

Co-Authored-By: Jared Roesch <roeschinc@gmail.com>

committed 5 years ago

0218557c Browse Directory

10 Jul, 2019 4 commits

init (#3476) · 273c0280
```
lint

update

address comment

comment out breaking test
```
雾雨魔理沙 committed 5 years ago
273c0280 Browse Directory

[Relay][RFC] Implement type checking for Any (#3221) · 3fb84e2b

* Implement type checking for Any

Remove code generation related changes

Remove compile changes

Remove more

Remove unification hack

Add some code back that was needed, and clean up test

Refactor test cases

WIP

Implement TypeHint AST

Add test case which should fail

Remove unification changes, and fix bug with let rec

Restore unification for shapes

Improve error reporting while debugging

All examples type check

All examples type check

WIP

First version that works with hints, needs clean up

Remove dead code

Tweaks

Remove type hint

Remove unecessary type hint stuff

Remove more type hints

Clean up

Expose Any expression node

Address CR

Fix

Fix solver

Kill unecessary code

Fix

PyLint

Fix

Relocate loops

Fix license and test

Lint again

Lint again

Fix loops

Fix docstring

Fix template error

Fix compiler issue

Fix compile err

Remove more runtime changes

Restore buffer

Fix segfault

Fix

Fix arange

* Address feedback

* Fix typo

* Fix arange

* Fix op level3

* Fix issue with Python wrapper

committed 5 years ago

3fb84e2b Browse Directory

[CPP] Refactor remove tvm/tvm.h (#3523) · 025a6c80
Tianqi Chen committed 5 years ago

025a6c80 Browse Directory

[Relay][Testing] Relay-to-Python compilation (#3156) · db841c24

* First pass at Relay-to-Python converter testing utility

* Indicate astor as a dependency

* Add astor dep to host as well

* Typos and small bugs

* Handle ADTs and matching in Python conversion

* Remove any dependency on ast.parse

* Eliminate unnecessary type var field in Python version of ConstructorValue (already gone on C++ side)

* Update constructor value, fix syntax errors

* Don't forget keywords arg on Call nodes

* Fix some incorrect calls to ast nodes

* Fix more calls, a little more cleaning up

* Missing cases in attr conversion

* Lower op calls instead of running them through interpreter, as in @MarisaKirisame's AoT compiler

* We do still need the module

* Remove changes to op attrs: Will PR separately

* Smoke test and corrections

* More tests and fixes

* Ensure imports are properly global in generated Python code

* Add unit tests for refs

* Add unit test for tuple indexing

* Add unit test for if expression

* Remove astor dependency

* Remove astor from meta.yaml too

* Fix if test and add basic local function test

* Add global function test, refactor earlier tests

* Correct 'clause' field in ADT so Python and C++ field names match

* More fixes and tests for matching and constructors

* Dramatically simplify matching: no need for a thunk

* Improve ref writing test

* Ensure local recursion works

* cleanup

* Add test for global recursion

* Add test for higher-order calls

* Get ops working, add basic tests

* Remove accidentally duplicated test

* More docstrings to appease pylint

* Forgot to fix a test using constructor values

* Reduce optimization level in fusion and fix tuple input to operators

* Test op with tuple output, fix tuple output code

* Add unit test for batch norm

* Add a couple more tricky test cases

* Correct nat constructor to drop unnecessary field

* Fix the op attrs file (accidentally reduced it)

* Address review comments

* Adapt to new ConstructorValue representation (no more runtime dep on module)

* Use pass manager and updated interfaces. Extend module.from_expr to accommodate necessary demands

* Use sequential return value

* Lift out nested conditionals

* Replace triple single quotes with triple double quotes

* Use main variable instead of entry_func

committed 5 years ago

db841c24 Browse Directory

09 Jul, 2019 3 commits

[Relay][VM]Compiling pattern matching (#3470) · 93d1c06d

* [Relay][VM]Compiling pattern matching

* Fix lint

* Remove debug code

* Move TreeNode definition

* merge ifi and selecti, todo: remove them

* fix lint

* remove ifi and selecti

* rename GetTagi to GetTag

* fix dltype

* fix more dltype

* Generalize If and select, and rename to Ifi and Selecti

* Fix lint

* Rename Ifi to If

* Change register default to match value

* Remove bad specialization for Move

* Stop use Select

* Remove Select

* TreeNode refactor

* Change entry_func name

* Remove Cmp due to rebase issue

committed 5 years ago

93d1c06d Browse Directory

Relaxing convolution infer checks. (#3511) · 3cab3c44

- Weight dtype can be different than idtype. So, using the weight tensor to set
the dtype of weight.
- For conv2d NCHWc operator, the weight can be of any dimension. For int8
computation on Intel, it can be 7D. Relaxing the weight type checking.

committed 5 years ago

3cab3c44 Browse Directory

[Relay] Roundtrip part of pretty printer and parser (#3460) · 7fb9557b
```
* init

fix rebase

lint

fix cmake

try again

fix ci

* add gitignore

* fix format

* do not include .interp and .tokens
```
雾雨魔理沙 committed 5 years ago
7fb9557b Browse Directory