Commits · f23ac96905b3a434d2ee3b8bcc912a24b3e63eba · wenyuanbo / tic

21 Feb, 2020 1 commit

[CODEGEN] Support cuda tensorcore subbyte int data type in auto tensorcore (#4546) · f23ac969

* support cuda tensorcore subbyte int data type in auto tensorcore

* add lisence

* pass cpplint

* fix code review comments

* merge the int4/int1 codegen tutorial into the existing auto tensorcore tutorial

* using master's new API

* disable tuning when cuda is not enabled

* address cr comment

* do not run the tuning

* fix test failure

* fix cpplint error

* fix bool type reduction bug

* 1. fix a index bug 2. fix returned bytes value of int1/int4/uint4

* fix typo

committed Feb 20, 2020

f23ac969 Browse Files

20 Feb, 2020 3 commits
- [DOCS] Fix Sphinx Warnings (RST indent, cross-ref, and image scale) (#4920) · 98e7709f
```
* fix indents

* Fix image scale and cross-ref
```
  Cody Yu committed Feb 20, 2020
  98e7709f Browse Files
- [Relay] Fix an assertion exposed by loop vectorizer (#4916) · efd35e86
```
- Allows uniform conditions for select expressions (the same as halide)
  exposed by the loop vectorizer.

Signed-off-by: Wei Pan <weip@nvidia.com>
```
  wpan11nv committed Feb 20, 2020
  efd35e86 Browse Files
- [DOCS] Fix sphinx warnings (#4917) · fd6d7837
```
* Fix Python docstrings

* More fixes

* Fix lint
```
  Cody Yu committed Feb 20, 2020
  fd6d7837 Browse Files
19 Feb, 2020 3 commits
- [REFACTOR] Polish ffi convention. (#4912) · 18295b27
```
* [REFACTOR] Polish ffi convention.

- Remove the src/api, keep registration local to the c++ function.
- Remove the api_internal as it is no longer needed.

* Update the codebase walk through
```
  Tianqi Chen committed Feb 19, 2020
  18295b27 Browse Files
- [RELAY][FRONTEND][TF] Fix FuseBatchNorm output cast error if need_cast is True (#4894) · fccf2268
  hcyang committed Feb 18, 2020
  
  fccf2268 Browse Files
- Fix tvm.target.generic_func runtime detection (#4910) · 406b5f76
  Andrew committed Feb 18, 2020
  
  406b5f76 Browse Files
18 Feb, 2020 8 commits
- [DOCS] Update API docs to reflect the status after the refactor. (#4907) · d2ae8c95
  Tianqi Chen committed Feb 18, 2020
  
  d2ae8c95 Browse Files
- [Relay] Expose FunctionGetAttr to Python (#4905) · 41835d17
```
* [Relay] Expose FunctionGetAttr to Python

* add test

Co-authored-by: Jon Soifer <jonso@microsoft.com>
```
  Jon Soifer committed Feb 18, 2020
  41835d17 Browse Files
- [Relay][Frontend][Keras] NHWC import support. (#4899) · 9d646543
```
* Basic test working

* Almost all tests working.

* all tests passing.

* Fixed lint.

* Improved Style.
```
  Josh Fromm committed Feb 18, 2020
  9d646543 Browse Files
- [REFACTOR][PY] Establish tvm.arith (#4904) · d1e1ac49
  Tianqi Chen committed Feb 18, 2020
  
  d1e1ac49 Browse Files
- [CI] Add autodocsum as dep (#4902) · 38d1dd24
  Tianqi Chen committed Feb 17, 2020
  
  38d1dd24 Browse Files
- [CI] Update ci docker to add autodocsumm (#4903) · 8310b252
  Tianqi Chen committed Feb 17, 2020
  
  8310b252 Browse Files
- Fixed bugs that occured when using bitwise operators on floating point type… · 976c08ad
```
Fixed bugs that occured when using bitwise operators on floating point type expressions. Further crash when using ops <<, >>, %. Finally added regression tests for both types of bug. (#4892)
```
  pankratz committed Feb 17, 2020
  976c08ad Browse Files
- [REFACTOR][PY] Establish tvm.te and tvm.driver (#4900) · 08338dd5
```
- Move the related files to tvm.te
- Move build_module.py to tvm.driver
```
  Tianqi Chen committed Feb 17, 2020
  08338dd5 Browse Files
17 Feb, 2020 5 commits
- [Relay][Pass] Fix bug in re-processing call node in MergeComposite pass (#4879) · 27a02844
```
* Fix bug in re-processing call node

* Add test

* Add to main

* temp changes to work from another machine

* fix rest of tests

* fix test_reuse_call_merge

* fix merge

Co-authored-by: Jon Soifer <jonso@microsoft.com>
```
  Jon Soifer committed Feb 17, 2020
  27a02844 Browse Files
- [DOCS] Introduce how to add hardware backend to FAQ (#4898) · 0b2d11a5
  Tianqi Chen committed Feb 17, 2020
  
  0b2d11a5 Browse Files
- Fast exponent (#4790) · 13140916
  Alex Gladkov committed Feb 17, 2020
  
  13140916 Browse Files
- Update faq.md (#4893) · a43e326f
```
various minor editorial updates - style, grammar, typos.
```
  Baden Hughes committed Feb 16, 2020
  a43e326f Browse Files
- Fix alpha_equal bug (#4897) · 95de08ba
  Zhi committed Feb 16, 2020
  
  95de08ba Browse Files
16 Feb, 2020 3 commits

[CI] Cleanup logfile before tutorial runs (#4896) · e7be8bf4
Tianqi Chen committed Feb 16, 2020

e7be8bf4 Browse Files
[Relay] Fix VM compiler for while loop with free vars (#4889) · 529ee1fe
```
* add additional switch to handle nested call node

* Fix VM compiler for while loop with free var
```
masahi committed Feb 15, 2020
529ee1fe Browse Files

[CodeGen][CUDA] Fix issues in cuda codegen (#4876) · d50ba721

- Do not emit __shared__ etc. as part of type for casting

- Fix fp16 reduction kernels with compiler errors:

  "no operator "+" matches these operands, volatile half + volatile half

  This patch inserts casts to remove volatile type qualifier following
  volatile loads (fp16 only). CUDA fp16 library headers should add
  volatile member functions.

- Update have_fp16 to include compute 6.1 GPUs, which do support fp16,
  although their fp16 throughput is low. Updated tests.

Signed-off-by: Wei Pan <weip@nvidia.com>

committed Feb 15, 2020

d50ba721 Browse Files

15 Feb, 2020 3 commits
- improve antlr import error message (#4888) · 7e9ec735
  masahi committed Feb 15, 2020
  
  7e9ec735 Browse Files
- [AutoTVM] Support range in index based tuners (#4870) · feda150e
```
* Support range in index based tuners

* Address comments

* Remove __*state__

* trigger CI
```
  Cody Yu committed Feb 14, 2020
  feda150e Browse Files
- [QNN] Add support for per channel weight scale in dense op (#4880) · a5e54b1d
```
* add test case for per channel dense

* add unit arg in tflite frontend

* update qnn legalize test

* fix output dim index
```
  masahi committed Feb 15, 2020
  a5e54b1d Browse Files
14 Feb, 2020 3 commits

[QNN] More doc fix on quantize and convolution (#4874) · 24c53a34
```
* [QNN] Doc fix on quantize and convolution

* update test
```
masahi committed Feb 13, 2020
24c53a34 Browse Files

[TOPI][CUDA] Enable vectorization on fp16 type (#4867) · 7013fc9a

- This allows to better utilize the memory bandwidth

- Note that not all cases are vectorized for fp16 datatype. For
  instance, when the size is not a multiple of 1024, the inner loop
  may be an expression that cannot be vectorized. In this case, a
  small inner loop is still benefical for latency hidding.

Signed-off-by: Wei Pan <weip@nvidia.com>

committed Feb 13, 2020

7013fc9a Browse Files

[REFACTOR][PY] Establish tvm.tir · b787ffa3

- Move related files into the corresponding location as in C++
- Keep the top-level TVM API backward compatible to make minimum changes in topi

committed Feb 13, 2020

b787ffa3 Browse Files

13 Feb, 2020 6 commits

Update docs/dev/virtual_machine.rst · a6c42b34
```
Co-Authored-By: Wei Chen <ipondering.weic@gmail.com>
```
Zhi committed Feb 13, 2020
a6c42b34 Browse Files
Update docs/dev/virtual_machine.rst · 243071ad
```
Co-Authored-By: Wei Chen <ipondering.weic@gmail.com>
```
Zhi committed Feb 13, 2020
243071ad Browse Files
fix vm doc · c8e17dd2
Zhi Chen committed Feb 13, 2020

c8e17dd2 Browse Files
Optimize x86 conv3d_ndhwc using data packing approach. (#4866) · 8d945872
```
Add tuneable conv3d_ndhwc schedule
```
Alex Gladkov committed Feb 12, 2020
8d945872 Browse Files

[FRONTEND][TFLITE] Add support for TFLite_Detection_PostProcess (#4543) · 70c63829

* [FRONTEND][TFLITE] Add support for TFLite_Detection_PostProcess

This adds support for the custom operator
TFLite_Detection_PostProcess which is commonly used in
object detection networks such as SSD Mobilenet. It
only adds support for when use_regular_nms = False.

Change-Id: I819b253c0eb6f0fa55da65d2634e09359b888828

* Added a test for the tflite custom op

Change-Id: Ie5baa092deae9a8bcffd2ebd9f6d346b90e58afd

* Removed trailing comma

Change-Id: Ib08f02b5f1a59a883048bfb36e4321152cd2e7f2

* Added spaces between divide

Change-Id: If1171fc03d211a809cedeb800804394972af4060

* Formatted comment

Change-Id: I3ce7e69b8d2c73aec57369c1c64ea1eec07f087b

* Reduced line length in test

Change-Id: I49eaafc3369070f8f3e85fbb965ad20972096c68

* Set random seed for test

Change-Id: I542a787d11422ea83c52147b2cb1144fcef0dd77

* Fixes to style

Change-Id: I2971b8ecebe08c882b2481a99f67cfbe515e0b1f

* Assert for incorrect number of inputs

Change-Id: I393f3b3b62be73e427498d98456fb1d5a214e0af

* Change comparison to pass linting

The linter was updated, so I needed to fix
a small style issue as a result.

Change-Id: Ia3c954565a00de92e7fb1912eae9ed9875d60c7c

committed Feb 13, 2020

70c63829 Browse Files

[REFACTOR][PY][API-CHANGE] Establish tvm.target · 51a265af

Move the related target modules into tvm.target.

API change:
- tvm.target.current_target -> tvm.target.Target.current
- tvm.datatype -> tvm.target.datatype

committed Feb 12, 2020

51a265af Browse Files

12 Feb, 2020 4 commits

[JVM] Update the runtime PackedFunc for module · 79cfab00
tqchen committed Feb 12, 2020

79cfab00 Browse Files
Fix optimize · aaf62e47
tqchen committed Feb 12, 2020

aaf62e47 Browse Files
[DOCS][PY] Sphinx docs about tvm.ir · 176ffe50
tqchen committed Feb 12, 2020

176ffe50 Browse Files

[REFACTOR][PY][API-CHANGE] establish tvm.ir, migrate corresponding files (#4862) · a5661611

* [REFACTOR][PY][API-CHANGE] establish tvm.ir, migrate corresponding relay files.

This PR establishes tvm.ir and migrates the corresponding relay
files into the new folder.

API Change:
- relay.Module -> tvm.IRModule

* Update with ADT

* Migrate transform

* address comments

* Migrate module

* Migrate json_compact

* Migrate attrs

* Move LoweredFunc to stmt temporarily

* temp migrate container

* Finish migrate container

committed Feb 11, 2020

a5661611 Browse Files

11 Feb, 2020 1 commit
- [Topi] Missing header (#4865) · 15df204f
  hlu1 committed Feb 11, 2020
  
  15df204f Browse Files