Commits · a44ac185f90e60eaf015cfd3d094b98bac6ff756 · wenyuanbo / tic

25 Nov, 2019 1 commit

[Perf] Enhance cudnn and cublas backend and enable TensorCore (#4353) · dabde40f

* add half and mix precision support to cublas backend

* add TensorCore support in CuDNN

* enhance CuDNN support

* address comments and fix lint

* fix

* add fp16 test

committed 5 years ago

dabde40f Browse Directory

23 Nov, 2019 1 commit
- [Relay][Legalize] Legalize conv2d_transpose for NHWC (#4399) · 9049d669
  Alexander Pivovarov committed 5 years ago
  
  9049d669 Browse Directory
21 Nov, 2019 1 commit
- [QNN] Lowering for Depthwise Convolution. (#4351) · 464ebb13
  Animesh Jain committed 5 years ago
  
  464ebb13 Browse Directory
19 Nov, 2019 1 commit
- [PERF] Parallelize reduction for CPU (#4158) · af52eba1
```
* [PERF] parallel reduction in cpu

* fix

* x

* update

* lint

* fix
```
  Haichen Shen committed 5 years ago
  af52eba1 Browse Directory
18 Nov, 2019 2 commits
- [SOURCE] Add ASF header to __init__.py files (#4359) · 00521fab
  Tianqi Chen committed 5 years ago
  
  00521fab Browse Directory
- [Frontend]Add TensorFlow FloorMod (#4308) · a226973b
```
* Add tf FloorMod

* Add floor_div/mod into topi and relay

* Add to rst

* Fix test
```
  Yao Wang committed 5 years ago
  a226973b Browse Directory
16 Nov, 2019 1 commit
- Fix docstring in topi.nn.fifo_buffer (#4349) · 0d891bf3
  Philip Hyunsu Cho committed 5 years ago
  
  0d891bf3 Browse Directory
15 Nov, 2019 3 commits
- fix inconsistent tag name (#4134) · a8e6ee9b
  ziyu-guo committed 5 years ago
  
  a8e6ee9b Browse Directory
- imp module is deprecated (#4275) · 9e6371fb
  Jian Weng committed 5 years ago
  
  9e6371fb Browse Directory
- [Contrib] Add MKL DNN option (#4323) · 72821b20
```
* [Contrib] Add MKL DNN

* update

* update
```
  Haichen Shen committed 5 years ago
  72821b20 Browse Directory
13 Nov, 2019 1 commit
- [TOPI][OP] Support Faster-RCNN Proposal OP on CPU (#4297) · 8cd5ccea
```
* Support Proposal operator on CPU.

* PyLint space issue

* PyLint space issue

* Pylint singleton-comparison issue
```
  Zhao Wu committed 5 years ago
  8cd5ccea Browse Directory
11 Nov, 2019 2 commits

Add More Shape Functions (#4179) · 62521453

* Add shape functions

* Fix get_const_tuple

* Fix cpplint

* Fix pylint

* Fix pylint

* rebase and fix

* Check Any for infer type

* Fix expand_dim shape func for zero rank input

* Fix pooling infer type

* Address comment

* Register layout transform attr

committed 5 years ago

62521453 Browse Directory

[TOPI][AlterOpLayout][ARM] Enabling NHWC to NCHW layout transformation. (#4249) · 1d243664
Animesh Jain committed 5 years ago

1d243664 Browse Directory

08 Nov, 2019 1 commit
- [TOPI][CUDA] Fix Winograd Kernel Size Support (#4276) · 76b79671
```
* fix_winograd_cuda_kernel_size

* add unit test
```
  Cody Hao Yu committed 5 years ago
  76b79671 Browse Directory
07 Nov, 2019 1 commit

[AutoTVM] Add batch_matmul to tunable operations (#4242) · 14a5a358

* Batch matmul tuning running but with errors.

* Default x86 schedule as good as before.

* Code Cleanup

* Remove unused argument.

* improved template documentation.

* Silly lint fix

* Removed leftover comment.

* Moved cfg declaration to schedule for batch_matmul

* Moved x86 dense cfg declaration to schedule.

* lint fix

* Removed duplicate cfg declaration in dense.

* Reverted changes to dense.

committed 5 years ago

14a5a358 Browse Directory

06 Nov, 2019 2 commits
- [TOPI] Fix bug in Winograd on CUDA (#4260) · 7211c277
```
* fix winograd

* move get padding after kernel transform
```
  Cody Hao Yu committed 5 years ago
  7211c277 Browse Directory
- [DOCS] Update link loc (#4257) · 86b844b9
  Tianqi Chen committed 5 years ago
  
  86b844b9 Browse Directory
30 Oct, 2019 1 commit
- [Relay][Topi][TensorFlow][ONNX][Lang] Add support for Any op (#4205) · b07b1952
```
* Add support for Any op

* Support ONNX frontend

* Add doc

* Add to relay docs

* Dummy change to retrigger CI
```
  Jon Soifer committed 5 years ago
  b07b1952 Browse Directory
28 Oct, 2019 1 commit

[Relay][Op] Enhance Upsample Operator to support float scales (#4206) · 8b1fb4d5

* :add scale2 for upsample

* update unit test for upsampling

* support latest upsample op for multiple frontend

* fix lint

* fix lint

* fix lint

* fix lint

* update scale description and rebase

committed 5 years ago

8b1fb4d5 Browse Directory

25 Oct, 2019 1 commit
- [TOPI][x86] Legalize - Support int8xint8 convolution to use VNNI instructions. (#4196) · 493c98d3
  Animesh Jain committed 5 years ago
  
  493c98d3 Browse Directory
24 Oct, 2019 3 commits

TensorCore Support using Intrinsic (#4136) · 324a9607

* add tensor core support

* avoid memory bank conflict

* fix thread sync & better performance

* better performance

* add schedule test for conv2d

* extend into BatchMatMul

* support config fragment shape and layout using intrinsic

* add TensorCore tutorial

* add int support and fix lint

* address comment

* add 32*16*8 TensorCore test

* fix wmma include logic

committed 5 years ago

324a9607 Browse Directory

[TOPI] Tunable Template for Conv2D HWCN on CUDA (#4168) · 4ab73634
```
* support conv2d HWCN in AutoTVM and Relay

* fix lint

* fix comments and unit tests
```
Cody Hao Yu committed 5 years ago
4ab73634 Browse Directory
Split adaptive_pool2d_avg into sum and div (#4186) · c9aa55cd
Yao Wang committed 5 years ago

c9aa55cd Browse Directory

22 Oct, 2019 1 commit
- [TOPI] Added support for Mali Bifrost target (#4047) · ecb0a7ea
  mbarrett97 committed 5 years ago
  
  ecb0a7ea Browse Directory
17 Oct, 2019 1 commit
- [TOPI][x86] Cascade lake support. (#4123) · 972f019c
```
* [TOPI][x86] Cascade lake support.

* Jenkins test debug 1.

* Testing cascade lake alone.
```
  Animesh Jain committed 5 years ago
  972f019c Browse Directory
15 Oct, 2019 1 commit
- [Relay][Topi] Disable conv NHWC pack int8. (#4038) · 68472596
  Animesh Jain committed 5 years ago
  
  68472596 Browse Directory
10 Oct, 2019 3 commits
- [TOPI] FIFO buffer op, to accelerate sequence modeling with dilated convolutions (#4039) · aa424139
```
* Add FIFO buffer op to enable explicit computation re-use in convolution

* Add a test

* Add end-to-end test with 1D convolution

* Add a stub in MXNet frontend

* Address reviewer comments

* Add back stub for MXNet frontend
```
  Philip Hyunsu Cho committed 5 years ago
  aa424139 Browse Directory
- correct error (#4093) · f3122887
  Leyuan Wang committed 5 years ago
  
  f3122887 Browse Directory
- Fixing tensor not found issue in bitserial operator (#4095) · 283afac0
  Aniket Rangrej committed 5 years ago
  
  283afac0 Browse Directory
09 Oct, 2019 2 commits
- [TOPI] Add valid auto tvm for Intel Graphics (#4078) · 4d875d1f
```
* add valid autotune

* fix pylint
```
  Leyuan Wang committed 5 years ago
  4d875d1f Browse Directory
- [TOPI][X86] Pool operator parallel support. (#4090) · 3a32729c
  Animesh Jain committed 5 years ago
  
  3a32729c Browse Directory
08 Oct, 2019 2 commits
- [topi] enable fp16 sort for arm (#4084) · 1c56c722
  Yizhi Liu committed 5 years ago
  
  1c56c722 Browse Directory
- [AlterOpLayout][x86] NHWC to NCHWc conv support. (#4080) · 153fd7ff
  Animesh Jain committed 5 years ago
  
  153fd7ff Browse Directory
01 Oct, 2019 2 commits

[TOPI]Add op argwhere (#3994) · fa4d3ec6

* Add op argwhere

* Move shape func to _algorithm.py

* Add lint rule

* Raise exception if rank is not supportted

* move argwhere to transform

* Add argwhere example

* Fix lint

* Add 1-d support

* cleanup

* Add more dtype support

* CR comment

* Improve error message

* Docs

* raise exception

committed 5 years ago

fa4d3ec6 Browse Directory

[topi] add ARM v8.2 udot (uint8) support (#3978) · 5cc17649

* [topi] add ARM v8.2 udot (uint8) support

* fix test case

* fix common conv2d schedule

* add back fp32_time in test

* fix lint

* fix doc, add support for int32_lanes=4, signed int

* fix lint

* add ic_bn % 4 checker in schedule

committed 5 years ago

5cc17649 Browse Directory

30 Sep, 2019 1 commit
- [ARITH] migrate indexdiv/mod to floordiv/mod (#4008) · f5f2feea
  Tianqi Chen committed 5 years ago
  
  f5f2feea Browse Directory
28 Sep, 2019 1 commit
- [ARITH] cleanup the indexmod/div on python side (#4028) · f98035b0
  Tianqi Chen committed 5 years ago
  
  f98035b0 Browse Directory
27 Sep, 2019 1 commit
- [ARITH] Use explicit div mode in python. (#4014) · 2ded2d8c
  Tianqi Chen committed 5 years ago
  
  2ded2d8c Browse Directory
26 Sep, 2019 1 commit

[TOPI][x86] Introduce schedule_injective_from_existing and unify external… · b330d301

[TOPI][x86] Introduce schedule_injective_from_existing and unify external schedules for all targets (#3983)

* Fix extern schedule for x86

* Register x86::schedule_extern

* Fix

* Fix

* Replace extern.py with extern.h

* Introduce new generic function schedule_injective_from_existing

* Fix

* Fix

* Add back to C++

* Fix style

* Injective schedule calls local schedule_injective_from_existing

* Fix

* Remove target arg from schedule_injective_from_existing

* Fix docs

* Try to fix unit test

* Fix test

* Fix other tests

* Fix bug

committed 5 years ago

b330d301 Browse Directory

25 Sep, 2019 1 commit

[TOPI] Move conv2d spatial pack schedule to dedicated file (#3972) · f1d2d46b

More schedules are making the conv2d.py file too large, so
we'd like to move the spatial pack schedule to dedicated file
before introducing NHWC schedule. No logic change in this patch.

committed 5 years ago

f1d2d46b Browse Directory