Commits · e3ddc8dae10b13459d740da02ec2cf980f0c577b · wenyuanbo / tic

19 Jan, 2018 1 commit
- simplify expr in get_const_tuple (#795) · 079e2307
```
* fix upsampling output shape

* simplify expr in get_const_tuple
```
  masahi committed 7 years ago
  079e2307 Browse Directory
16 Jan, 2018 5 commits

[TOPI] Basic x86 schedules (#775) · 3df42cd7

* add basic x86 schedules

* parallelize & vectorize batchnorm + relu

* fuse conv into bn + relu

* move rc loop to outer

* add nhwc conv

* change weight layout to hwcf

* conv + bn + relu fusion for nhwc conv

* fix conv_nhwc schedule when no fusion

* clean up default parallel schedules

* simplify elemwise parallel

* fix elemwise parallel for batch == 1

* update nhwc conv test

* fix and add comment

* fix lint

* remove redundant import

* remove default multithreading for some ops

* remove default multithreading for global pool

committed 7 years ago

3df42cd7 Browse Directory

fix mali topi for python3 (#789) · 7ca44d7a
Lianmin Zheng committed 7 years ago

7ca44d7a Browse Directory
fix (#788) · b9a6c091
Xingjian Shi committed 7 years ago

b9a6c091 Browse Directory
[TOPI] add schedule for ARM Mali GPU (#786) · 16694815
```
* add schedule for ARM Mali GPU

* fix lint

* fix lint
```
Lianmin Zheng committed 7 years ago
16694815 Browse Directory
[CODEGEN] fix vector conversion for opencl (#783) · 8d263e37
```
* support more argument type in depthwise_conv2d

* mark all pointer as 'restrict' & fix vector conversion for opencl
```
Lianmin Zheng committed 7 years ago
8d263e37 Browse Directory

15 Jan, 2018 1 commit
- try to fix test (#784) · 3ff2d958
```
try to fix

fix
```
  Xingjian Shi committed 7 years ago
  3ff2d958 Browse Directory
12 Jan, 2018 1 commit
- [LLVM] Enable same target option in JITModule (#778) · ee6f22ab
```
* [LLVM] Enable same target option in JITModule

* not set mcpu explicitly
```
  Tianqi Chen committed 7 years ago
  ee6f22ab Browse Directory
11 Jan, 2018 1 commit

[TOPI] Upsampling op support (#772) · be457348

* add upsampling cpu op

* add upsampling gpu schedule

* add doc for upsampling op

add more doc

* cleanup upsampling test

* add doc

* fix lint

* fix lint

* fix lint

* remove unused import

* remove skimage dependency

* remove skimage import

* remove schedule_upsampling

committed 7 years ago

be457348 Browse Directory

04 Jan, 2018 1 commit
- correct conv2d workload for resnet18 (#750) · 29226a5f
  Yizhi Liu committed 7 years ago
  
  29226a5f Browse Directory
03 Jan, 2018 1 commit
- [CONTRIB] rocBLAS integration (#751) · a407ec15
```
* rocblas integration

* fix include

* fix lint
```
  masahi committed 7 years ago
  a407ec15 Browse Directory
02 Jan, 2018 1 commit

[CONTRIB] cuBLAS integration (#744) · 3d5032ae

* add cublas support

* integrate cublas to topi dense

* add cublas error check

* minor fix

* fix lint

* remove topi import from contrib unittest

committed 7 years ago

3d5032ae Browse Directory

29 Dec, 2017 1 commit
- Let CUDNN choose the best algo (#734) · 66fa0c3d
```
* use cudnn findalgo to choose the best algo

* fix lint
```
  masahi committed 7 years ago
  66fa0c3d Browse Directory
27 Dec, 2017 2 commits
- [TOPI]Support dim-0 tensor in topi broadcast/reduce (#731) · 2a8e0746
```
* support dim-0 tensor in topi ops

revert transform

* revert
```
  Xingjian Shi committed 7 years ago
  2a8e0746 Browse Directory
- [TOPI] CUDNN integration (#730) · 85e4058c
```
* add target.libs to target str representation

* integrate cudnn into topi cuda

* append target.libs to target.options
```
  masahi committed 7 years ago
  85e4058c Browse Directory
26 Dec, 2017 1 commit

[TOPI] add extern schedule for cudnn and miopen (#724) · cdb2f873

* add extern schedule for miopen

* fix comment

* optionally dispatch to miopen from topi

* fix lint

* check if current target is None

* use generic dispatch for rocm conv2d

* fix lint

* fix workspace bug

* remove blank line

* remove blank line

* remove blank line

committed 7 years ago

cdb2f873 Browse Directory

25 Dec, 2017 1 commit

[TOPI] 1bit dense operator on x86_64 (#629) · 36b34738

* add x86_64 target

* add binary dense operator

* rebase

* improve schedule

* remove x86 target

* improve schedule

committed 7 years ago

36b34738 Browse Directory

04 Dec, 2017 1 commit
- Support rank-0 tensor (#687) · f2b91392
```
* Support rank-0 tensor

* fix lint
```
  Tianqi Chen committed 7 years ago
  f2b91392 Browse Directory
27 Nov, 2017 1 commit
- [TOPI] Fix for pooling (#673) · 2e3f8e74
  ziheng committed 7 years ago
  
  2e3f8e74 Browse Directory
25 Nov, 2017 1 commit
- [PASS] Allow compact checking when strides is available (#669) · b55361b4
```
* [PASS] Allow compact checking when strides is available

* remove assert compact
```
  Tianqi Chen committed 7 years ago
  b55361b4 Browse Directory
19 Nov, 2017 1 commit

Fixed nnvm issue #239 (#660) · 72992208

* scheduler tweaked for super resolution perf

* conv2d_transpose schedule error fixed

* nnvm issue #239 fixed

committed 7 years ago

72992208 Browse Directory

16 Nov, 2017 1 commit

Conv2d scheduler tweaked for super resolution perf (#652) · 7d620be4

* scheduler tweaked for super resolution perf

* lint error fixed

* lint error fixed

* conv2d_transpose schedule error fixed

committed 7 years ago

7d620be4 Browse Directory

14 Nov, 2017 2 commits
- [TOPI] Add out_dtype argument for conv2d; Add x86 schedules (#646) · c6a1241e
```
* [TOPI] Add out_dtype argument for conv2d; Add x86 schedules

* Fix

* Fix lint

* Fix
```
  ziheng committed 7 years ago
  c6a1241e Browse Directory
- conv2d perf improved for conv2d_56_64_128, super resolution workloads added (#643) · afc693dc
```
* conv2d perf improved for conv2d_56_64_128, test name added to differentiate workloads

* fix lint error
```
  Leyuan Wang committed 7 years ago
  afc693dc Browse Directory
13 Nov, 2017 1 commit

Fix conda packages (#642) · a908b831

* Make the tvm conda package build with in-place source and use cmake from conda.

* Add a package for topi.

committed 7 years ago

a908b831 Browse Directory

09 Nov, 2017 1 commit
- android gemm for topi/recipe (#628) · 35485307
  Yizhi Liu committed 7 years ago
  
  35485307 Browse Directory
08 Nov, 2017 1 commit
- conv2d_56_64_128 mark==1 bug fixed (#624) · 25847a4f
  Leyuan Wang committed 7 years ago
  
  25847a4f Browse Directory
06 Nov, 2017 1 commit
- [TOPI] fix weight layout in conv2d_transpose (#616) · c1008ec4
  Yuwei Hu committed 7 years ago
  
  c1008ec4 Browse Directory
03 Nov, 2017 1 commit
- [TOPI] modify conv2d_transpose schedule (#613) · a152a9cb
  Yuwei Hu committed 7 years ago
  
  a152a9cb Browse Directory
30 Oct, 2017 1 commit
- vgg16 workload error fixed (#598) · 3c895464
  Leyuan Wang committed 7 years ago
  
  3c895464 Browse Directory
27 Oct, 2017 1 commit
- [TOPI] Support ceil_mode in pooling (#593) · 88662130
  Tianqi Chen committed 7 years ago
  
  88662130 Browse Directory
26 Oct, 2017 1 commit
- add helpful message to topi test (#592) · 2f2170f4
  masahi committed 7 years ago
  
  2f2170f4 Browse Directory
25 Oct, 2017 1 commit
- [TOPI] add conv2d_transpose_nchw (#586) · 5f79521b
  Yuwei Hu committed 7 years ago
  
  5f79521b Browse Directory
23 Oct, 2017 1 commit

Update topi/cuda schedules to use target.max_num_threads (#577) · 12218358

* update topi/cuda schedules to use target.max_num_threads

* allow num_thread to be larger than cuda.max_num_threads

* remove get_max_num_threads and make it inline

committed 7 years ago

12218358 Browse Directory

22 Oct, 2017 1 commit
- [PASS] More robust UnrollLoop configuratin (#576) · 0f1e0ff0
  Tianqi Chen committed 7 years ago
  
  0f1e0ff0 Browse Directory
15 Oct, 2017 1 commit
- [CODEGEN] Bugfix multiple condition generation (#558) · 163c4795
  Tianqi Chen committed 7 years ago
  
  163c4795 Browse Directory
14 Oct, 2017 2 commits
- [Refactor] Introduce target generic dispatch system (#556) · eb761f36
```
* [TVM] Introduce target generic dispatch system

* fix target warning
```
  Tianqi Chen committed 7 years ago
  eb761f36 Browse Directory
- enable rocm target for topi/recipes. add timing util to gemm test. (#554) · c3cac464
  masahi committed 7 years ago
  
  c3cac464 Browse Directory
13 Oct, 2017 2 commits
- Add rocm target to topi tests (#548) · 85c545c7
```
* add masahi to contributors

* enable rocm target in topi tests
```
  masahi committed 7 years ago
  85c545c7 Browse Directory
- [TOPI] Fix declaration for different dtypes (#546) · b20678b0
  ziheng committed 7 years ago
  
  b20678b0 Browse Directory