Commits · 1dcf8a16ee3a93dff5ffc1ad1a66892eda03ef13 · wenyuanbo / tic

10 Nov, 2019 1 commit
- Rename ml.dmlc.tvm to org.apache.tvm (#4290) · 1dcf8a16
  Yizhi Liu committed Nov 09, 2019
  
  1dcf8a16 Browse Files
09 Nov, 2019 1 commit

Auto TensorCore CodeGen (#4234) · d64bf6b5

* Add Auto TensorCore TensorCore Unit Test

* Rebase to tvm master branch & Add auto tensor core

* Code Refine

* Add tensor core switch by pragma

* Add pragma in tensor core example code

* Get real tile size to replace hard coded 16

* support more than 2 dimensions (e.g. batchmatmul) for buffer bind scope

* support batch matmul

* Move cuda env check to tensor_core.cc

* Coderefine for tensor_core.cc

* Refine comments

* Some refinements of code and comment

* Update TensorCore UT to pass the CPU test

* remove redundant code

* matmul's storage align for different layout

* Add support for differenct position of type cast

* Add formal tutorial for auto tensorcore codegen

* move tensorcore check up to tutorial code

* code and doc refine

* comment out tune_and_evaluate in tutorial

* fix cpplint error

committed Nov 09, 2019

d64bf6b5 Browse Files

08 Nov, 2019 2 commits
- Update tvm_runtime.h (#4278) · 281f643c
```
fix the problem that android_rpc compilation failed
```
  peike committed Nov 08, 2019
  281f643c Browse Files
- [TOPI][CUDA] Fix Winograd Kernel Size Support (#4276) · 76b79671
```
* fix_winograd_cuda_kernel_size

* add unit test
```
  Cody Hao Yu committed Nov 08, 2019
  76b79671 Browse Files
07 Nov, 2019 2 commits

[Relay][Frontend][ONNX] Add support for broadcasting to Where and MatMul (#4267) · 5bcd3313
Jon Soifer committed Nov 07, 2019

5bcd3313 Browse Files

[AutoTVM] Add batch_matmul to tunable operations (#4242) · 14a5a358

* Batch matmul tuning running but with errors.

* Default x86 schedule as good as before.

* Code Cleanup

* Remove unused argument.

* improved template documentation.

* Silly lint fix

* Removed leftover comment.

* Moved cfg declaration to schedule for batch_matmul

* Moved x86 dense cfg declaration to schedule.

* lint fix

* Removed duplicate cfg declaration in dense.

* Reverted changes to dense.

committed Nov 06, 2019

14a5a358 Browse Files

06 Nov, 2019 4 commits
- [TOPI] Fix bug in Winograd on CUDA (#4260) · 7211c277
```
* fix winograd

* move get padding after kernel transform
```
  Cody Hao Yu committed Nov 06, 2019
  7211c277 Browse Files
- [Contrib] Fix error message at callback_get_section_size() (#4221) · ddaa9530
```
* [Contrib] Fix error message at callback_get_section_size()

* Trigger notification
```
  Neo Chien committed Nov 06, 2019
  ddaa9530 Browse Files
- [VTA] Hotfix for padded load test in Chisel VTA (#4264) · 1eca1ad1
```
* Update TensorUtil.scala

* Update test_vta_insn.py
```
  Liangfu Chen committed Nov 06, 2019
  1eca1ad1 Browse Files
- [DOCS] Update link loc (#4257) · 86b844b9
  Tianqi Chen committed Nov 05, 2019
  
  86b844b9 Browse Files
05 Nov, 2019 2 commits
- workaround typing.Deque import error for Python 3.5 (#4254) · aae5cde8
  zhuochen committed Nov 05, 2019
  
  aae5cde8 Browse Files
- Require LLVM >= 9 for AMDGPU backend (#4253) · 635831c7
```
LLVM 8 will crash when loading the bitcodes

This is a runtime check as the file will be compiled in even when
USE_ROCM OFF is used in the configuration if ROCM is installed
in the default location.

Fixes: #4087
```
  Thomas Viehmann committed Nov 05, 2019
  635831c7 Browse Files
04 Nov, 2019 4 commits
- CI trigger after repo move (#4252) · 411fe277
  Tianqi Chen committed Nov 04, 2019
  
  411fe277 Browse Files
- [Relay][Frontend][Tensorflow] Fix GatherV2, Add StopGradient (#4238) · 3f472f94
```
* Add StopGradient. Add batch_dims attr to ignore list for GatherV2

* Trigger CI
```
  Trevor Morris committed Nov 04, 2019
  3f472f94 Browse Files
- remove PEP498 f-string new feature for support python3.5 (#4250) · 996cf30e
  Kim committed Nov 04, 2019
  
  996cf30e Browse Files
- Fix typo in err msg (#4251) · 1b053ec0
  XFPlus committed Nov 04, 2019
  
  1b053ec0 Browse Files
02 Nov, 2019 2 commits

[VTA] Performance optimize, remove unnecessary contigious memory use. (#4246) · 008aa838

* [VTA] Performance optimize, remove unnecessary contigious memory use.

Issue:
Uop maintain a cache vector to copy uop data into contigious DRAM memory for
FPGA/Simulator use, but this cache vector not get clear after FPGA/Simulator
core run, in Resnet18 case, if we printf the cache size in UopQueue::ReadBarrier
function, we can saw such cache size keep increase, this would cause
no use data copy and unnecessary contigous DRAM memory malloc.

Analysis:
This issue caused by not clear cache_ vector when do
uop_queue_.Reset().

Solution:
Override BaseQueue Reset function in UopQueue and add cache_ clear
logic.

* address review comments, remove spacing.

committed Nov 01, 2019

008aa838 Browse Files

Support reshape for dynamic shape in tf converter (#4185) · e9039d04

* Support reshape for dynamic shape in tf converter

* Only allow reshape directly after shape function for symbolic input shape

* Fix lint

committed Nov 01, 2019

e9039d04 Browse Files

01 Nov, 2019 7 commits
- [NODE][REFACTOR] Rename IRFunctor->NodeFunctor, use func pointer (#4247) · 9a3d2ec9
```
* [NODE][REFACTOR] Rename IRFunctor->NodeFunctor, use function pointer for dispatching.

Previously we used std::function for the functor dispatching.
It introduces additional overhead and problems during dll destruction(of std::function).

This PR changes the std::function to function pointers.
This change a bit restrictions around the set_dispatch that we can get around,
but will improve the general efficiency by reducing one level of indirection in the std::function.
We also no longer need special marcos to register functions to the Functor.
```
  Tianqi Chen committed Nov 01, 2019
  9a3d2ec9 Browse Files
- Implement explicit IR representation of memory alloction (#3560) · 2083513f
  Jared Roesch committed Nov 01, 2019
  
  2083513f Browse Files
- [Relay][Prelude] Add more dtypes to tensor_t (#4233) · 19164063
  Wei Chen committed Nov 01, 2019
  
  19164063 Browse Files
- [Relay][Pass] Avoid FoldConstant folding some ops (#4245) · aa49e851
```
* [Relay][Pass] Avoid FoldConstant folding some ops

* rename
```
  Wuwei Lin committed Nov 01, 2019
  aa49e851 Browse Files
- [ Relay ][ Frontend ][ Tensorflow ]add op add_n to relay/frontend/tensorflow.py (#4181) · cd717dea
  Kim committed Nov 01, 2019
  
  cd717dea Browse Files
- [ARITH] Fix lowering of FloorMod (#4236) · bafc675c
  Sergei Grechanik committed Nov 01, 2019
  
  bafc675c Browse Files
- Fix the problem that android_rpc compilation failed. (#4244) · a897d36d
```
Signed-off-by: qinqiuping <autumnqin@126.com>
```
  autumnqin committed Nov 01, 2019
  a897d36d Browse Files
31 Oct, 2019 6 commits
- [BUILD] Disable utvm standalone runtime by default (#4240) · a3ca1a4d
  Tianqi Chen committed Oct 31, 2019
  
  a3ca1a4d Browse Files
- [CUDA] Fix fp16 intrin, disable bad fp16 vecadd test for now (#4239) · ebfcd28c
  Tianqi Chen committed Oct 31, 2019
  
  ebfcd28c Browse Files
- [CI] Update GPU docker to cuda10 (#4228) · b2155f70
```
* [CI] Update the ci-gpu to use cuda10

* [CI] Enforce tensorcore gpu for unittest
```
  Tianqi Chen committed Oct 31, 2019
  b2155f70 Browse Files
- Fix typo in get_output doc-string (#4237) · a6221a1f
  KoolKoffee committed Oct 31, 2019
  
  a6221a1f Browse Files
- [CI] Move gpu docker binary to cuda10 (#4229) · 26cbc3fb
```
* [CI] Move gpu docker binary to cuda10

* Fix the gcn tutorial
```
  Tianqi Chen committed Oct 30, 2019
  26cbc3fb Browse Files
- [Doc] Update ANTLR instruction (#4231) · 18673112
```
* [Doc] Update ANTLR instruction

* Update docs/install/from_source.rst
```
  Wei Chen committed Oct 30, 2019
  18673112 Browse Files
30 Oct, 2019 9 commits

[Relay] Install Relay Prelude program in package install (#4227) · 31b47c84
Wei Chen committed Oct 30, 2019

31b47c84 Browse Files
[CI] use llvm9 for the gpu tests (#4224) · 83385d42
```
* [CI] use llvm9 for the gpu tests

* Update Docker script to support new nvidia docker
```
Tianqi Chen committed Oct 30, 2019
83385d42 Browse Files
[Relay][Topi][TensorFlow][ONNX][Lang] Add support for Any op (#4205) · b07b1952
```
* Add support for Any op

* Support ONNX frontend

* Add doc

* Add to relay docs

* Dummy change to retrigger CI
```
Jon Soifer committed Oct 30, 2019
b07b1952 Browse Files

[Relay][Frontend][ONNX] New Operators and Opsets to Support BERT (#4197) · 156aa590

* Added slice v10

* Added constantofshape operation and small refactor.

* Finished one_hot implementation.

* Reshape working across all bert layers.

* Fixed constantofshape and removed code duplication.

* onnx model fully ingested.

* Working on improving onnx tests.

* Changed onnx testing to use onnxruntime instead of caffe2, also formatted.

* Add arbitrary output nodes to onnx frontend.

* Added v6 tiling for bert squad 8 support.

* Small syntax fixes

* Reduced code duplication in split opset versions.

* Added batch matmul test

* Added unstack split testing.

* Adde onehot test, needs a little cleanup probably.

* Replaced deprecated constant fill with constantofshape and updated tests accordingly.

* Added tests for new opset version of slice and tile.

* lint clean up

* Lint fixes

* Changed onnx dependency

* Went back to caffe2 runtime for CI integration.

* Rebase and small typo/syntax changes.

* Added hard casting of onehot attributes to int.

committed Oct 30, 2019

156aa590 Browse Files

[PYTHON] Add __init__ to the generated grammar so that it can be installed properly (#4223) · 71f39be5
Tianqi Chen committed Oct 30, 2019

71f39be5 Browse Files
[ARITH] Fix the rule y < x && x <= y (#4220) · fc020b87
Sergei Grechanik committed Oct 30, 2019

fc020b87 Browse Files

Improve the lowering of Qnn Dense (#4213) · 2be444f9

* [QNN] Improving Dense lowering.

* - Moving get_shape method to util
- Finalizing the test cases and the code structure for optimized dense computation.

* - Fixing cpplint.

* - Addressing review comments.

* - Renaming the variables correctly.

* - Renaming the variables correctly.

committed Oct 30, 2019

2be444f9 Browse Files

Fix typo in packed_func.h (#4219) · 50e4aa0d
Bohan Hou committed Oct 30, 2019

50e4aa0d Browse Files
[Relay] Add Python type functor and tests (#4209) · 09f0ac33
```
* Add Python type functor and tests

* Lint roller
```
Logan Weber committed Oct 29, 2019
09f0ac33 Browse Files