Commits · d93e1dca81f3bcf122148e29d0b5e72af97811b1 · wenyuanbo / tic

06 Jan, 2020 2 commits

[CONV] Asymmetric padding (#4511) · 34b98eb7

* [CONV] Asymmetic padding

* fix lint error

* update for legalize, rocm and cudnn

* add more test cases

* change more symmetric padding

* change conv2d winograd tests according orginal cases

* remove 'alter_op_layout.h' header in bitserial.cc

committed 5 years ago

34b98eb7 Browse Directory

[Topi]Allow empty tensor for reshape, tile and strided_slice (#4618) · 8e2f229a
```
* Support empty tensor

* Fix schedule

* Refactor

* Minor fix

* Fix pylint

* Merge cpp and python is_empty_shape
```
Yao Wang committed 5 years ago
8e2f229a Browse Directory

03 Jan, 2020 1 commit

[TOPI, Relay] Add half_pixel option to Resize op (#4610) · e8a2c9b3

* add onnx resize converter

* update frontends

* updating topi

* adding onnx resize tests

* fixed NHWC test by casting size dtype to int32

* fix tests

* fix lint

* update existing test cases

* fix tensorflow frontend

* fix lint

* remove NHWC stuff

* update topi resize test for half_pixel

* update doc

* fix doc

* remove onnx resize bits

committed 5 years ago

e8a2c9b3 Browse Directory

01 Jan, 2020 1 commit
- [FRONTEND][TF] Add conv3d (#4604) · 1ef1605a
```
* [FRONTEND][TF] Add conv3d

* fix high rtol
```
  optima2005 committed 5 years ago
  1ef1605a Browse Directory
27 Dec, 2019 1 commit

[TOPI] add 3D upsampling Op. (#4584) · c3deec19

* [TOPI] add 3D upsampling Op.

* fix lint issues

* change align_corners to coordinate_transformation_mode

* fix resize3d half_pixel

* make a simple function and clean up trilinear_resize3d_python

* fix doc

committed 5 years ago

c3deec19 Browse Directory

24 Dec, 2019 1 commit

[Relay/Topi][Op] Added native DepthToSpace and SpaceToDepth Operators (#4566) · 9b92c539

* Added tvm function stencil for subpixel operations to topi.

* Topi subpixel operators added and tested.

* Added subpixel attrs.

* Added depth_to_space relay attributes.

* depth_to_space fully working.

* Fixed NHWC shape bug.

* SpaceToDepth in and all tests passing.

* lint fixes.

* Added string include

* Fixed topi formatting.

* Added DCR/CDR mode to depthtospace operator.

committed 5 years ago

9b92c539 Browse Directory

23 Dec, 2019 1 commit
- [Relay] add max_pool3d in relay and TF converter (#4551) · f277da76
```
* [Relay] add max_pool3d in relay and TF converter

* fix comments
```
  Yong Wu committed 5 years ago
  f277da76 Browse Directory
18 Dec, 2019 1 commit
- Implement 1d deconvolution (#4476) · d430fbb5
  Alex Gladkov committed 5 years ago
  
  d430fbb5 Browse Directory
12 Dec, 2019 1 commit

[TOPI] implement pool3d op (#4478) · 41959ed2

* [TOPI] implement pool3d op

* use PoolInferCorrectLayout for both 2d and 3d pooling

* unify MakeMaxPool and MakeAvgPool

committed 5 years ago

41959ed2 Browse Directory

04 Dec, 2019 1 commit

implement conv3d op (#4400) · 7e32f373

* implement conv3d op

* add back missed conv2d_output_shape by mistake

* fix typo and docs, add topi test

* rebase to master and merge 2d/3d unification

* use cudnn.conv_forward

committed 5 years ago

7e32f373 Browse Directory

03 Dec, 2019 1 commit
- [TOPI][Relay][OP] Add a strided_set operation. (#4303) · 6d88c987
  abergeron committed 5 years ago
  
  6d88c987 Browse Directory
21 Nov, 2019 1 commit
- [TOPI] Fix flaky testcase for floor div (#4382) · 1562eaeb
```
* [TOPI] Fix flaky testcase for floor div

* avoid check at 0.0
```
  Yizhi Liu committed 5 years ago
  1562eaeb Browse Directory
18 Nov, 2019 1 commit
- [Frontend]Add TensorFlow FloorMod (#4308) · a226973b
```
* Add tf FloorMod

* Add floor_div/mod into topi and relay

* Add to rst

* Fix test
```
  Yao Wang committed 5 years ago
  a226973b Browse Directory
13 Nov, 2019 1 commit
- [TOPI][OP] Support Faster-RCNN Proposal OP on CPU (#4297) · 8cd5ccea
```
* Support Proposal operator on CPU.

* PyLint space issue

* PyLint space issue

* Pylint singleton-comparison issue
```
  Zhao Wu committed 5 years ago
  8cd5ccea Browse Directory
06 Nov, 2019 1 commit
- [TOPI] Fix bug in Winograd on CUDA (#4260) · 7211c277
```
* fix winograd

* move get padding after kernel transform
```
  Cody Hao Yu committed 5 years ago
  7211c277 Browse Directory
30 Oct, 2019 1 commit
- [Relay][Topi][TensorFlow][ONNX][Lang] Add support for Any op (#4205) · b07b1952
```
* Add support for Any op

* Support ONNX frontend

* Add doc

* Add to relay docs

* Dummy change to retrigger CI
```
  Jon Soifer committed 5 years ago
  b07b1952 Browse Directory
28 Oct, 2019 2 commits
- [Relay][Op] Enhance Upsample Operator to support float scales (#4206) · 8b1fb4d5
```
* :add scale2 for upsample

* update unit test for upsampling

* support latest upsample op for multiple frontend

* fix lint

* fix lint

* fix lint

* fix lint

* update scale description and rebase
```
  Xingyu Zhou committed 5 years ago
  8b1fb4d5 Browse Directory
- [TOPI] Fix flaky testcase for check round (#4211) · 2e07447e
  Tianqi Chen committed 5 years ago
  
  2e07447e Browse Directory
24 Oct, 2019 1 commit
- [TOPI] Tunable Template for Conv2D HWCN on CUDA (#4168) · 4ab73634
```
* support conv2d HWCN in AutoTVM and Relay

* fix lint

* fix comments and unit tests
```
  Cody Hao Yu committed 5 years ago
  4ab73634 Browse Directory
11 Oct, 2019 1 commit

[codegen] Add multiple operands and function support when using fp16 compilation (#4056) · ce72e9b5

* overload half operators for cuda codegen

* add float16 te test_op_level1

* fix test_op_level1.py

* fix lint

* disable fp16 test if gpu does not support

* disable fp16 test if gpu does not support

* bypass float16 test if gpu does not support float16

committed 5 years ago

ce72e9b5 Browse Directory

10 Oct, 2019 1 commit

[TOPI] FIFO buffer op, to accelerate sequence modeling with dilated convolutions (#4039) · aa424139

* Add FIFO buffer op to enable explicit computation re-use in convolution

* Add a test

* Add end-to-end test with 1D convolution

* Add a stub in MXNet frontend

* Address reviewer comments

* Add back stub for MXNet frontend

committed 5 years ago

aa424139 Browse Directory

02 Oct, 2019 1 commit
- [RELAY/PASS] Fix the extent for the post_stmt in the loop partition (#3734) · a7873b0a
  Umang Yadav committed 5 years ago
  
  a7873b0a Browse Directory
22 Sep, 2019 1 commit
- Add operator `isnan` (#3979) · 16d4da4d
```
* add expr `isnan`

* move to intrinsic

* doc & add to topi

* fix error from ci
```
  Huang, Guangtai committed 5 years ago
  16d4da4d Browse Directory
20 Sep, 2019 1 commit

Add support for MXNet pad operator. (#3739) · 719d6d47

MXNet pad is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.pad

Add support for parameter 'None' in MXNet slice operator.

MXNet 'slice' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.slice

Add support for MXNet cos, sin, arctan

MXNet 'cos' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.cos

MXNet 'sin' is described at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.sin

MXNet arctan is descirbed at
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.arctan

Add support for MXNet 1D Convolution and 1D Deconvolution

MXNet convolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Convolution

MXNet Deconvolution is described at:
https://mxnet.incubator.apache.org/api/python/symbol/symbol.html#mxnet.symbol.Deconvolution

committed 5 years ago

719d6d47 Browse Directory

19 Sep, 2019 1 commit

[TOPI] Add proper scheduling for dense on CUDA (#3923) · bec08fec

* add proper scheduling for dense on CUDA

* add fallback config and fix unit test

* fix corner cases

* refactoring

* fix bias and add testcase

* let fusion happen

committed 5 years ago

bec08fec Browse Directory

16 Sep, 2019 1 commit

[TOPI] operator support: logical_and, logical_or, logical_not (#3929) · ab1853c2

* [TOPI] operator support: logical_and, logical_or, logical_not

* [TOPI] operator support: logical_and, logical_or, logical_not

* [TOPI] fix test cases for operator support: logical_and, logical_or, logical_not

* [TOPI] fix test cases for operator support: logical_not

committed 5 years ago

ab1853c2 Browse Directory

09 Sep, 2019 1 commit

[Relay/TOPI][Op] Add erf intrinsic and op (#3702) · 2f5b155a

* add more ops

* stop vectorization for erf

* x

* cleanup

* fix

* add whitelist for vectorizable intrin

* add tf converter

* fix dense

* fix

* add missing intrin

* fix mxnet frontend

* fix nvptx

committed 5 years ago

2f5b155a Browse Directory

08 Sep, 2019 1 commit
- change docker install script (#3524) · 184fa484
  雾雨魔理沙 committed 5 years ago
  
  184fa484 Browse Directory
01 Sep, 2019 1 commit

[Relay][Any] Add shape func for dynamic shape (#3606) · eef35a57

* init shape func in interpreter and vm compiler

* Update interpreter

* fix

* lint

* lint

* fix

* remove hack

* update

* fix

* fix

* update

* address comments & update for shape_of

* fix lint

* update

* fix hybrid

* lint

* fix bug & add take shape func

* lint

* lint

* update

* fix flaky test

* add todo

committed 5 years ago

eef35a57 Browse Directory

22 Aug, 2019 2 commits

[TOPI][Relay][TensorFlow] Add OneHot operator (#3781) · 554df211

* Add one-hot to Relay

* topi implementation

* Working

* add topi test

* Add TF test

* Fix check

* fix linting issues

* fix documentation

* Fix documentation

* Add support for on_value, off_value, axis, dtype

* Add full support for axis

* Fix compute and update test_forward

* Move on_value and off_value to inputs

* Add topi test

* Update tests

* Update docs

* Fix style

* re-enable tests

* Add one_hot to mxnet converter

committed 5 years ago

554df211 Browse Directory

Changed topi cc resize to python implementation with new features. (#3788) · 7264cb6a
Josh Fromm committed 5 years ago

7264cb6a Browse Directory

06 Aug, 2019 1 commit

[Relay] [TOPI] `{relay,topi}.nn.sparse_transpose` for **Square** CSR matrices (#3707) · 3b287c4d

* add build gcn tutorial

* add transpose operator for square sparse matrices

* remove extra files

* change loop tag

* comply with lint

* comply with lint -- line too long

* comply with lint

* lint check

* lint check

* lint check

* apply marisa and theirry's reviews

committed 5 years ago

3b287c4d Browse Directory

01 Aug, 2019 1 commit

Add support for Tensorflow operators log1p, cos, sin (#3614) · d72cdfa6

The patch adds support for Tensorflow operators log1p and cos
Tensorflow log1p is described at https://www.tensorflow.org/api_docs/python/tf/math/log1p
Tensorflow cos is described at https://www.tensorflow.org/api_docs/python/tf/math/cos
Tensorflow sin is described at https://www.tensorflow.org/api_docs/python/tf/math/sin

committed 5 years ago

d72cdfa6 Browse Directory

31 Jul, 2019 1 commit
- [TOPI][CUDA] schedule for group_conv2d (#3663) · 11da1ca3
```
* [TOPI][CUDA] schedule for group_conv2d

* Fix #flops
```
  Wuwei Lin committed 5 years ago
  11da1ca3 Browse Directory
30 Jul, 2019 1 commit

[TOPI] Fix traverse function not inline zero-input op (#3623) · 9d583cf5

* Fix traverse_inline not inline zero input op properly

* Add where to python and set tag to broadcast

* Fix inline

* test

* fix test target

* fix

committed 5 years ago

9d583cf5 Browse Directory

28 Jul, 2019 1 commit
- Hotfix for issue #3641. (#3644) · 026162ad
  Balint Cristian committed 5 years ago
  
  026162ad Browse Directory
26 Jul, 2019 1 commit
- [TOPI][CUDA] Schedule for pool_grad (#3622) · f1ede9a9
```
* [TOPI][CUDA] Schedule for pool_grad

* Relay test

* Fix fused op

* doc

* Remove set scope local
```
  Wuwei Lin committed 5 years ago
  f1ede9a9 Browse Directory
25 Jul, 2019 1 commit
- Add Winograd matrices computation. (#3553) · 97e333ca
  Balint Cristian committed 5 years ago
  
  97e333ca Browse Directory
24 Jul, 2019 1 commit
- [TOPI][Relay] max_pool2d & avg_pool2d gradient (#3601) · 5c410037
  Wuwei Lin committed 5 years ago
  
  5c410037 Browse Directory
23 Jul, 2019 1 commit

We observe multiple groups across a range of domains (ASR, NMT, LM, etc), (#3566) · d6dcd6c5

internally and externally, interested in replacing standard dense layers with
block-sparse matrix multiplication layers. The motivations are generally: higher
performance (due to reduction in FLOPs, memory bandwidth/cache footprint),
enabling larger models (e.g. fitting more layers in a given memory budget).

Some public work along these lines:

* https://openai.com/blog/block-sparse-gpu-kernels/
* https://openai.com/blog/sparse-transformer/
* https://arxiv.org/abs/1802.08435
* https://arxiv.org/abs/1711.02782

Various groups have been able to successfully train models with reasonable
levels of sparsity (90%+) with marginal accuracy changes, which suggests
substantial speedups are possible (as this implies a >10x reduction in FLOPs).

It is fairly straightforward to realize these theoretical speedups, see e.g. TVM
benchmarks for Intel CPUs in
https://gist.github.com/ajtulloch/e65f90487bceb8848128e8db582fe902, and CUDA
results in https://github.com/openai/blocksparse, etc.

* https://github.com/openai/blocksparse (CUDA)
* https://software.intel.com/en-us/mkl-developer-reference-c-mkl-bsrmm (MKL BSRM)
* https://docs.scipy.org/doc/scipy-0.14.0/reference/generated/scipy.sparse.bsr_matrix.html (SCIPY BSR representation)

This is extracted from an internal patch we've been using internally. There are
various extensions possible (int8/fp16/bf16, CUDA/other GPU architectures), but
this is a reasonable starting point. This needs more thorough unit test coverage
however.

We follow the conventions established by scipy.sparse.bsr_matrix and other
libraries, see the unit tests for details.

For folks interested in experimenting with scheduling/AutoTVM etc,
https://gist.github.com/ajtulloch/e65f90487bceb8848128e8db582fe902 is a useful
starting point.

committed 5 years ago

d6dcd6c5 Browse Directory