Commits · c493baf3a47a43b10e45f64c6fb6aa0e30fa273f · wenyuanbo / tic

21 Aug, 2017 1 commit
- Update installation guide of windows (#364) · c493baf3
```
* update installation guide of windows

* update installation doc of windows
```
  Xingjian Shi committed Aug 20, 2017
  c493baf3 Browse Files
20 Aug, 2017 6 commits
- [CODEGEN][LLVM] Refactor cpu runtime related code to CodeGenCPU (#361) · 72d64520
  Tianqi Chen committed Aug 20, 2017
  
  72d64520 Browse Files
- [iOS] Better RPC guide and bug fix (#357) · 7d5d9ec9
  ziheng committed Aug 20, 2017
  
  7d5d9ec9 Browse Files
- [DOC] Add install prerequisites (#358) · 422bf824
```
Add install prerequisites of customized building
```
  Shuai Yuan committed Aug 20, 2017
  422bf824 Browse Files
- [BUILD][LLVM] Support LLVM mainline 5.0 6.0 (#356) · 622be047
```
* [BUILD][LLVM] Support LLVM mainline 5.0 6.0

* Reduce parallelism
```
  Tianqi Chen committed Aug 20, 2017
  622be047 Browse Files
- [Python] Dist wheel tools (#348) · c6ebb5a1
  ziheng committed Aug 19, 2017
  
  c6ebb5a1 Browse Files
- changed makefile to build rocm backend (#355) · fa232720
  Aditya Atluri committed Aug 19, 2017
  
  fa232720 Browse Files
19 Aug, 2017 1 commit
- modify schedule_depthwise_conv2d_nchw (#350) · fa53dbdf
  Yuwei HU committed Aug 19, 2017
  
  fa53dbdf Browse Files
18 Aug, 2017 2 commits
- conv_nchw parameter updated to the one generates mobilenet benchmarks, doc typo fixed (#345) · ed9f3897
```
* conv_nchw parameter updated to the one which generates mobilenet benchmarks, doc typo fixed

* removed unused variables
```
  Leyuan Wang committed Aug 18, 2017
  ed9f3897 Browse Files
- update depthwise convolution api (#344) · b5c6b993
  Yuwei HU committed Aug 17, 2017
  
  b5c6b993 Browse Files
17 Aug, 2017 6 commits
- Add tutorial for convolution in CUDA (#343) · 8ca12d87
  Haichen Shen committed Aug 17, 2017
  
  8ca12d87 Browse Files
- [DOC] Add link to release blog (#342) · d2a98a05
  Tianqi Chen committed Aug 17, 2017
  
  d2a98a05 Browse Files
- [SUBMODULE] switch to https (#341) · 8b247607
  Tianqi Chen committed Aug 17, 2017
  
  8b247607 Browse Files
- [DOC] Release note (#340) · 7ffcff2d
  Tianqi Chen committed Aug 17, 2017
  
  7ffcff2d Browse Files
- Fix CUDA library search (#339) · ae6abe82
  William Moses committed Aug 17, 2017
  
  ae6abe82 Browse Files
- Allow install-dev to include all necessary header files (#338) · 48ec5445
  William Moses committed Aug 17, 2017
  
  48ec5445 Browse Files
16 Aug, 2017 3 commits

[PASS] RewriteUnsafeSelect lowers unsafe select to condition expr (#335) · 090468aa
Tianqi Chen committed Aug 15, 2017

090468aa Browse Files
[NNPack] Support for threadpool (#334) · 25ded693
```
* [NNPack] Support for threadpool

* fix lint

* fix lint

* Use static class function
```
ziheng committed Aug 15, 2017
25ded693 Browse Files

[WIP] [TOPI] Depth wise Conv for NHWC (#325) · 989e99e6

* rename the nchw and pass the unit test; going to do it for nhwc depthwise

* bug with fusion

* nchw works fine; nhwc float32 problem remains

* still cannot bind them together

* fusion works

* syntax fix

* all bugs fixed; test cases pass

* minor fix on nn.h

committed Aug 15, 2017

989e99e6 Browse Files

15 Aug, 2017 11 commits
- [Contrib] CuDNN v7 Support (#311) · 64870ffb
```
* [Contrib] CuDNN v7 Support

* Add test
```
  ziheng committed Aug 15, 2017
  64870ffb Browse Files
- [BUILD] Enable cudnn in gpu build (#333) · 0ccc281d
  Tianqi Chen committed Aug 15, 2017
  
  0ccc281d Browse Files
- [TOPI] Isolate padding option, improve decl of depthwise/conv2d/pool (#332) · 7196c791
  Tianqi Chen committed Aug 15, 2017
  
  7196c791 Browse Files
- [TOPI] Improve dilate (#330) · abccd9cd
  Tianqi Chen committed Aug 15, 2017
  
  abccd9cd Browse Files
- [TOPI] Fix conv2d for small input channels (#331) · 9ac46bea
```
* __init__ updated

* pull request updated

* build_module added

* typo fixed

* another typo fixed

* conv2d gpu scheduler for two layouts moved to tvm

* changes made according to CR

* conv2d_nchw formating updated, conv2d_hwcn tests updated

* lint error fixed

* element wise operator schedule fusing fixed for conv2d

* conv2d_nchw topi test added, all resnet workloads now pass

* conv compute lint error fixed

* fixed python 3 compatibility problem

* conv2d tensor input support added, test typo fixed, ir_pass.Simplify changed to util.get_const_int

* fixed channel numer < 4 error, also made sure other splitting factor woudn't be 0
```
  Leyuan Wang committed Aug 15, 2017
  9ac46bea Browse Files
- [TOPI] Add ops compute (#323) · 0ad590c0
```
* [TOPI] Add ops compute

Remove 'compute' and add assert for safety

Add document

fix lint

fix softmax

* fix batch norm
```
  ziheng committed Aug 14, 2017
  0ad590c0 Browse Files
- [DOC] Document update (#329) · ce18b565
  Tianqi Chen committed Aug 14, 2017
  
  ce18b565 Browse Files
- update depthwise_conv2d schedule and testing (#328) · 07e56b9a
  Yuwei HU committed Aug 14, 2017
  
  07e56b9a Browse Files
- [TOPI] Move ewise.h -> elemwise.h (#327) · 8edd047b
```
* [TOPI] Move ewise.h -> elemwise.h

* fix test
```
  Tianqi Chen committed Aug 14, 2017
  8edd047b Browse Files
- [TOPI] Add broadcast and reduce operators (#267) · 760475f9
```
[TOPI] Add broadcast and reduce operators
```
  Xingjian Shi committed Aug 14, 2017
  760475f9 Browse Files
- [BUILD] Simplify build process (#326) · a59774e3
  Tianqi Chen committed Aug 14, 2017
  
  a59774e3 Browse Files
14 Aug, 2017 5 commits

[TOPI] C++ doc (#320) · cbdd14f1
Nicolas Vasilache committed Aug 14, 2017

cbdd14f1 Browse Files

[TOPI] add dilation operators (#316) · b0c42f3b

* add dilation operators

* fix pylint

* dilate testcases success

* n-D tensor dilation

* support arbitrary dimension

committed Aug 14, 2017

b0c42f3b Browse Files

[DOC] Include TOPI in doxygen (#321) · ba6664a3
```
* [DOC] Include TOPI in doxygen

* update
```
Tianqi Chen committed Aug 14, 2017
ba6664a3 Browse Files

[TOPI] conv2d nchw gpu scheduler (#315) · cbff637f

* __init__ updated

* pull request updated

* build_module added

* typo fixed

* another typo fixed

* conv2d gpu scheduler for two layouts moved to tvm

* changes made according to CR

* conv2d_nchw formating updated, conv2d_hwcn tests updated

* lint error fixed

* element wise operator schedule fusing fixed for conv2d

* conv2d_nchw topi test added, all resnet workloads now pass

* conv compute lint error fixed

* fixed python 3 compatibility problem

* conv2d tensor input support added, test typo fixed, ir_pass.Simplify changed to util.get_const_int

committed Aug 13, 2017

cbff637f Browse Files

[TOPI] Move topi.nn.util to topi.util (#319) · d76712d1
```
* [TOPI] Move topi.nn.util to topi.util

* update the path
```
Tianqi Chen committed Aug 13, 2017
d76712d1 Browse Files

13 Aug, 2017 3 commits

[WIP] C++ topi contributions (#312) · f08de2b6

* [WIP] C++ topi contributions

Summary:
This diff implements C++ topi contributions for:
  - relu with parametrix threshold
  - pad with generic padBefore / padAfter specification
  - matmult with transposes
  - conv2d_nchw, conv2d_hwcn with runtime constant padding and strides
  - depthwise_conv2d_nchw with runtime constant padding and strides
  - group_conv2d_ngchw with runtime constant padding and strides
  - broadcast_to a broadcastable shape
  - broadcast_bop where bop is an usual binary op (+ - * / %)

Convolution padding is implemented using the pad operation.
To avoid extra memory consumption, it is generally recommended to inline the padding with the autoinliner.
Unfortunately in its current form the elemwise checks are too restrictive to allow inlining.
So this diff also proposes an extension to LHS injective (i.e. no reduction axis in the current IR design)

Test Plan:
Tested in C++ testsuite in a separate repository, I am looking for suggestions to quickly spin up some tests for tvm.

Reviewers: tqchen

Subscribers:

Tasks:

Tags:

Blame Revision:

* Review + Lint + GSG C++

committed Aug 13, 2017

f08de2b6 Browse Files

[PASS][PRAGMA] Allow pragma debug_skip_region to skip region of computation (#318) · a3776ba5
Tianqi Chen committed Aug 13, 2017

a3776ba5 Browse Files
[PASS] Memory barrier detection, storage access lower. (#317) · 79e482bc
Tianqi Chen committed Aug 13, 2017

79e482bc Browse Files

12 Aug, 2017 1 commit
- [PASS] More improvement of canonical (#314) · afa20869
  Tianqi Chen committed Aug 11, 2017
  
  afa20869 Browse Files
11 Aug, 2017 1 commit
- minor fix (#313) · 3c2569a0
  Yuwei HU committed Aug 11, 2017
  
  3c2569a0 Browse Files