Commits · c846d17c65ffd0d0cd9f9c3be321af0ad1da13f3 · wenyuanbo / tic

16 Sep, 2019 5 commits

[TOPI] Improve conv2d_transpose schedule on X86 and CUDA (#3948) · c846d17c

* improve conv2d_transpose x86 performance by reusing conv2d schedule

* parallelize across batches to make large-batch conv2d and conv2d_transpose faster

* improve doc for autotvm.task.space.FallbackConfigEntity.fallback_with_reference_log

* add fallback schedule for schedule_conv2d_transpose_nchw_cuda

* fix pylint

* fix pylint

* unify conv2d_transpose declaration in topi.nn and topi.x86

committed Sep 16, 2019

c846d17c Browse Files

[Graph Tuner] Fix benchmark layout in graph tuner (#3926) · b577171d
```
* Fix graph tuner benchmarking layout transform

* Add test
```
Yao Wang committed Sep 17, 2019
b577171d Browse Files
[tvm][codegen] Make buffer auto broadcast independent to the order of input args (#3956) · 8577c81b
```
* [tvm][codegen] Make buffer auto broadcast independent to the order of the input arg

* fix indent
```
Zhi committed Sep 16, 2019
8577c81b Browse Files

[TOPI] operator support: logical_and, logical_or, logical_not (#3929) · ab1853c2

* [TOPI] operator support: logical_and, logical_or, logical_not

* [TOPI] operator support: logical_and, logical_or, logical_not

* [TOPI] fix test cases for operator support: logical_and, logical_or, logical_not

* [TOPI] fix test cases for operator support: logical_not

committed Sep 16, 2019

ab1853c2 Browse Files

[QNN] Legalization for Intel x86 QNN Conv2D (#3896) · 26eaea4a
```
* QNNLegalize for conv2d

* [QNN] Legalization for Intel x86 QNN Conv2D
```
Animesh Jain committed Sep 16, 2019
26eaea4a Browse Files

15 Sep, 2019 3 commits

Enable miopen transpose convolution and fp16 support (#3952) · 9e4f07b4
```
* Enable miopen transpose convolution and fp16 support

* linter
```
Peter Yeh committed Sep 16, 2019
9e4f07b4 Browse Files

[Relay][TensorFlow] Add support for SquaredDifference (#3930) · 0482623e

* Add support for SquaredDifference and StopGradient; minor fix in BatchMatMul

* Remove stopgradient change

* Resolve PR comment

* Dummy change to retrigger CI

* dummy change to retrigger CI

committed Sep 15, 2019

0482623e Browse Files

[AutoTVM] Enhance tuning space of split (#3949) · da039794

* Refine policies for define_split

- Rename policy "all" to "factors"
- Add policy "verbose" and "power2"

* Refine search space

* add doc

committed Sep 14, 2019

da039794 Browse Files

14 Sep, 2019 1 commit
- trivial (#3954) · e35e1cc2
  Junru Shao committed Sep 14, 2019
  
  e35e1cc2 Browse Files
13 Sep, 2019 6 commits
- 1) Add EQ op to the deduce_bound and add unittests for the same (#3775) · 4b431c67
```
2) Add EQ support in the loop partition and add test for the same
3) Change typo truc to trunc
```
  Umang Yadav committed Sep 13, 2019
  4b431c67 Browse Files
- Vulkan2 Runtime API (#3849) · 2536465c
  Andrew Tulloch committed Sep 13, 2019
  
  2536465c Browse Files
- [VTA] RPC path update. (#3924) · 06aecc60
```
Issue:
RPC path get changed into "vta_rpc" from "pynq_rpc", but related
document still use old informaiton.

Solution:
Update RPC path information.
```
  Hua Jiang committed Sep 13, 2019
  06aecc60 Browse Files
- Add AVX512VNNI support for TVM (#3388) · bb82e09f
  Jianyu Huang committed Sep 13, 2019
  
  bb82e09f Browse Files
- Refactoring x86 conv2d_NCHWc (#3944) · eb220d92
  Animesh Jain committed Sep 13, 2019
  
  eb220d92 Browse Files
- Fix CUDA int8x4 vectorize (#3928) · 195973c0
```
* Fix int8x4 vectorize

* Fix gpu shared/local memory accumulate

* Add test_shared_memory for int8x4

* Adjust test format

* Fix cpplint
```
  noituIover committed Sep 12, 2019
  195973c0 Browse Files
12 Sep, 2019 4 commits

Do type checking for the input and kernel in the qnn conv2d (#3904) · 880c2603

* [QNN] Convolution 2D Implementation.

Rebasing. Empty commit.

Clang-format styling.

* Reformatting code.

* Fixing lint issues.

committed Sep 12, 2019

880c2603 Browse Files

[TOPI][CUDA] Support cuBLAS BatchMatMul (#3936) · 88f9bfd4
```
* Support cuBLAS BatchMatMul

* Add test and check target name
```
Jon Soifer committed Sep 12, 2019
88f9bfd4 Browse Files

[RFC] [Contrib] Minimal runtime (~12kb .text on ARMv7/x86) for subset of TVM models (#3567) · 1de52bb0

This is an alternative implementation of a subset of the TVM runtime API (and
graph runtime) that focuses entirely on reducing code size, at the expense of
functionality (no tvm.extern(..) calls via PackedFunc, CPU only, etc). It might
be worth incrementally expanding the surface area if there's interest.

The motivation for this work was seeing what the minimal useful subset of the
TVM runtime is. This is relevant for e.g. super code-size constrained
applications in e.g. embedded/mobile. The current runtime is more like O(100KiB)
or so, so this might be compelling for some users.

The smaller surface area for auditing might make this relevant for
https://github.com/dmlc/tvm/issues/3159, or the usecases I was thinking about in
https://github.com/dmlc/tvm/issues/2523#issuecomment-459165815 re: the Rust
runtime.

The symbols in the tvm::minimalruntime space (i.e. excluding std:: and
picojson::) are about 5KiB, so I think there's a bunch of room here (i.e. we
could replace picojson:: with [`jsmn`](https://zserge.com/jsmn.html) or
something, and we could replace more of the `std::unordered_map` usage, etc with
custom primitives as well (similar to the `DynArray`).

committed Sep 13, 2019

1de52bb0 Browse Files

[Relay][Module] Refactor the way we interface between different modules of Relay. (#3906) · 4e2d707f

* Module refactor

* Add load module

* Add support for idempotent import

* Tweak load paths

* Move path around

* Expose C++ import functions in Python

* Fix import

* Add doc string

* Fix

* Fix lint

* Fix lint

* Fix test failure

* Add type solver

* Fix lint

committed Sep 11, 2019

4e2d707f Browse Files

11 Sep, 2019 4 commits
- [Community] Add reviewer Balint Cristian (#3935) · c31e7771
  Lianmin Zheng committed Sep 11, 2019
  
  c31e7771 Browse Files
- [Arm] parallel batch axis (#3931) · eb3a7382
```
* support LLVM trunk

* guard with USE_LLVM in if condition for c++14

* GREATER_EQUAL -> GREATER

* [Arm] parallel batch axis
```
  Yizhi Liu committed Sep 11, 2019
  eb3a7382 Browse Files
- [TFLite] Support depthwise convolution multiplier greater than 1 (#3922) · 968ffef6
  Zhao Wu committed Sep 10, 2019
  
  968ffef6 Browse Files
- [Relay] fix exponential blowup in interpreter (#3559) · 54dbcc28
  雾雨魔理沙 committed Sep 10, 2019
  
  54dbcc28 Browse Files
10 Sep, 2019 2 commits
- [Relay][Frontend][Keras] Fix ReLU in Keras Converter missed the case (#3917) · 5bff6cce
```
* [Relay][Frontend][Keras] Fix ReLU in Keras Converter missed the case

* [Relay][Frontend][Keras] Add test case for ReLU in Keras Converter missed the case

* [Relay][Frontend][Keras] Add test case for ReLU in Keras Converter missed the case
```
  Neo Chien committed Sep 10, 2019
  5bff6cce Browse Files
- [CODEGEN] Remove incorrect check for LLVM in C codegen test (#3921) · 42195a48
  Pratyush Patel committed Sep 10, 2019
  
  42195a48 Browse Files
09 Sep, 2019 4 commits
- [Relay][Training] Add gradient for max. (#3915) · 0f4c151f
```
* save

* save
```
  雾雨魔理沙 committed Sep 09, 2019
  0f4c151f Browse Files
- [VTA][Config] hotfix denano10 (#3918) · 83d2418a
  Luis Vega committed Sep 09, 2019
  
  83d2418a Browse Files
- Numpy compatible dtype inference for `tvm.convert` and `tvm.const` (#3861) · 63a91ebf
```
* numpy compatible type inference

* update

* try to fix

* fix

* try to fix

* fix lint

* Update nn.h

* cast to int32

* try to fix

* fix again

* retrigger ci
```
  Xingjian Shi committed Sep 10, 2019
  63a91ebf Browse Files
- [Relay/TOPI][Op] Add erf intrinsic and op (#3702) · 2f5b155a
```
* add more ops

* stop vectorization for erf

* x

* cleanup

* fix

* add whitelist for vectorizable intrin

* add tf converter

* fix dense

* fix

* add missing intrin

* fix mxnet frontend

* fix nvptx
```
  Haichen Shen committed Sep 09, 2019
  2f5b155a Browse Files
08 Sep, 2019 2 commits
- [Relay][Training] Add gradient for cast (#3894) · 6a377f77
```
save

fix

fix grad
```
  雾雨魔理沙 committed Sep 07, 2019
  6a377f77 Browse Files
- change docker install script (#3524) · 184fa484
  雾雨魔理沙 committed Sep 08, 2019
  
  184fa484 Browse Files
07 Sep, 2019 7 commits

[Fix] Fix blas cmake for mac os (#3898) · 7a15aedf
```
* fix cmake for mac os

* rename
```
Haichen Shen committed Sep 08, 2019
7a15aedf Browse Files

Support LLVM trunk (#3907) · 8c50469c

* support LLVM trunk

* guard with USE_LLVM in if condition for c++14

* GREATER_EQUAL -> GREATER

committed Sep 08, 2019

8c50469c Browse Files

Fix a typo (#3913) · 6604593b
noituIover committed Sep 07, 2019

6604593b Browse Files
Add .hsaco save/load for ROCm target (#3852) · e8c6adc6
```
fix lld
```
Peter Yeh committed Sep 07, 2019
e8c6adc6 Browse Files
add luis as reviewer (#3909) · 54150cd5
Haichen Shen committed Sep 07, 2019

54150cd5 Browse Files

[VTA] Support TLPP in function simulator. (#3555) · 50c4546f

* [VTA] Support TLPP in function simulator.
Issue:
currently vta function simulator just doing serialized instruction
execution, the dependency logic of runtime ISA which use for task
level pipe line parallelism can not get verified by function simulator.

Solution:
make the simulator driver to be multiple thread and support TLPP.

Benefit:
TLPP support VTA function simulator would make VTA logic testing/debug
/change more easy.

replace boost lockfree queue

add configure control for simulator tlpp enable or disable.

change code tyle into google style.

Wrap queue read/write and sync logic to make function call more simple.

Add some comments.

Remove MT logic, change into Single thread mode.

address review comments.

code style change to match google code style and add comments.

add cmake macro to enable/disable simulator tlpp logic.

submodule update.

correct file name mentioned in comments.

* remove USE_VTA_FSIM_TLPP.

committed Sep 06, 2019

50c4546f Browse Files

[TOPI] Intel graphics conv2d autotvm template added (#3839) · 70042b78

* update lint

* lint fixed

* lint updated

* lint fixed

* lint fixed

* lint fixed

* updates

* add intel graphics as a package

* remove print info

* depthwise conv2d schedule added for intel graphics

* asdf

* fix lint

* fix lint

* fix ci

* add channels

committed Sep 06, 2019

70042b78 Browse Files

06 Sep, 2019 2 commits
- save (#3901) · 02ddb5a9
  雾雨魔理沙 committed Sep 06, 2019
  
  02ddb5a9 Browse Files
- [Relay][Op] Make Type Relation catch more errors (#3899) · 19f8c123
```
* save

* init

* move type_relations
```
  雾雨魔理沙 committed Sep 06, 2019
  19f8c123 Browse Files