Commits · cedd3900621ada1813d0465e234a0fecfdf76bff · wenyuanbo / tic

08 Nov, 2017 2 commits
- Support vector operations for AMD (llvm IR) (#623) · cedd3900
```
* Support vector operations for AMD (llvm IR)

* fix whitespace

* update comments, docstring
```
  eqy committed Nov 08, 2017
  cedd3900 Browse Files
- conv2d_56_64_128 mark==1 bug fixed (#624) · 25847a4f
  Leyuan Wang committed Nov 08, 2017
  
  25847a4f Browse Files
07 Nov, 2017 1 commit

remove minimum 32-bit restriction (#621) · 08e4d085

Change minimum 32-bit restriction for floating point types to 8-bit.
This change is to enable reduced precision types that may use vector operations underneath the hood (cases #lanes > 1 such as half4).

committed Nov 06, 2017

08e4d085 Browse Files

06 Nov, 2017 2 commits
- add tanh dispatch (#619) · c7101537
  masahi committed Nov 06, 2017
  
  c7101537 Browse Files
- [TOPI] fix weight layout in conv2d_transpose (#616) · c1008ec4
  Yuwei Hu committed Nov 06, 2017
  
  c1008ec4 Browse Files
03 Nov, 2017 2 commits
- [DLPack] Upgrade dlpack to 0.2 (#609) · 8214d6ca
  Tianqi Chen committed Nov 03, 2017
  
  8214d6ca Browse Files
- [TOPI] modify conv2d_transpose schedule (#613) · a152a9cb
  Yuwei Hu committed Nov 03, 2017
  
  a152a9cb Browse Files
02 Nov, 2017 1 commit
- [INTRIN] Enable popcount (#606) · 685f78d0
```
* enable popcount intrin

* fix lint

* add test

* fix python3
```
  Yuwei Hu committed Nov 02, 2017
  685f78d0 Browse Files
01 Nov, 2017 1 commit
- Fixed build with metal on MacOS with case-sensitive FS (#601) · 3bb2eef5
  Cyril Lashkevich committed Nov 01, 2017
  
  3bb2eef5 Browse Files
30 Oct, 2017 1 commit
- vgg16 workload error fixed (#598) · 3c895464
  Leyuan Wang committed Oct 30, 2017
  
  3c895464 Browse Files
27 Oct, 2017 1 commit
- [TOPI] Support ceil_mode in pooling (#593) · 88662130
  Tianqi Chen committed Oct 27, 2017
  
  88662130 Browse Files
26 Oct, 2017 4 commits
- add helpful message to topi test (#592) · 2f2170f4
  masahi committed Oct 26, 2017
  
  2f2170f4 Browse Files
- [ROCM] remove fma dispatch (#591) · 20144de2
```
* removed fma dispatch

* added comments to explain why remove fma

* fix lint

* use fmuladd intrin for fma dispatch
```
  masahi committed Oct 26, 2017
  20144de2 Browse Files
- [ROCM] View llvm ir and gcn asm with module.get_source(...) (#590) · 6a5d6165
```
* view llvm ir and gcn asm with module.get_source(...)

* fix lint
```
  masahi committed Oct 26, 2017
  6a5d6165 Browse Files
- [BUFFER] Smarter slice to detect compactness (#587) · a76851d7
```
* [BUFFER] Smarter slice to detect compactness

* move simplify of begins early
```
  Tianqi Chen committed Oct 25, 2017
  a76851d7 Browse Files
25 Oct, 2017 1 commit
- [TOPI] add conv2d_transpose_nchw (#586) · 5f79521b
  Yuwei Hu committed Oct 25, 2017
  
  5f79521b Browse Files
24 Oct, 2017 3 commits
- [PYTHON] Allow no de-allocation when exit (#583) · 25f95766
  Tianqi Chen committed Oct 24, 2017
  
  25f95766 Browse Files
- [CODEGEN] Fix CPU compute attribute (#582) · da27cfec
  Tianqi Chen committed Oct 24, 2017
  
  da27cfec Browse Files
- [DOCS] Fix tag_scope example (#581) · 18e4a1bd
  Wei Chen committed Oct 24, 2017
  
  18e4a1bd Browse Files
23 Oct, 2017 1 commit

Update topi/cuda schedules to use target.max_num_threads (#577) · 12218358

* update topi/cuda schedules to use target.max_num_threads

* allow num_thread to be larger than cuda.max_num_threads

* remove get_max_num_threads and make it inline

committed Oct 22, 2017

12218358 Browse Files

22 Oct, 2017 3 commits
- [PASS] More robust UnrollLoop configuratin (#576) · 0f1e0ff0
  Tianqi Chen committed Oct 22, 2017
  
  0f1e0ff0 Browse Files
- add friendly tips when not found cl and link (#574) · 69759c0c
```
* add friendly tips when not found cl and link

* fix lint
```
  Hu Shiwen committed Oct 21, 2017
  69759c0c Browse Files
- [SCHEDULE] Detect duplicate IterVar in reorder (#575) · 1791b121
  Wei Chen committed Oct 21, 2017
  
  1791b121 Browse Files
20 Oct, 2017 1 commit

[ROCM] Working math function support for ROCm backend, a bug fix in LLVM based codegen (#570) · 326edd76

* added math function support

* bug fix extern func call in llvm based codegen

lint fix

fix build

bug fix extern func call in llvm based codegen

* moved rocm bitcodes detection to python

committed Oct 19, 2017

326edd76 Browse Files

19 Oct, 2017 1 commit

[PYTHON] Improve equality wrapper (#567) · ab858e3f

use `object.__eq__`(default object identity comparison) as default
implementation of same_as. This should be OK since `EqualOp` and
`NotEqualOp` are pure Python object, `object.__eq__` is sufficient.

committed Oct 18, 2017

ab858e3f Browse Files

17 Oct, 2017 2 commits
- [PYTHON] Improve equal sugar (#564) · 9a2f01ab
```
* [PYTHON] Improve equal sugar

* fix comment
```
  Tianqi Chen committed Oct 17, 2017
  9a2f01ab Browse Files
- [CODEGEN] Use correct math intrin for metal (#562) · 60510a47
  Tianqi Chen committed Oct 16, 2017
  
  60510a47 Browse Files
16 Oct, 2017 3 commits
- [ARITH] More caninical simplfy (#561) · 621337d5
```
* [ARITH] More caninical simplfy

* [DEBUG] Use HalideIR with trace logging
```
  Tianqi Chen committed Oct 16, 2017
  621337d5 Browse Files
- [FIX] Fix target warning (#560) · 9e8bae25
```
* [FIX] Fix target warning

* [FIX] Deduplicate options

* Fix

* Fix
```
  ziheng committed Oct 16, 2017
  9e8bae25 Browse Files
- [CODEGEN] Allow link additional module (#559) · 6894d42b
```
* [CODEGEN] Allow link additional module

* fix py3

* add register back
```
  Tianqi Chen committed Oct 15, 2017
  6894d42b Browse Files
15 Oct, 2017 2 commits
- [CODEGEN] Bugfix multiple condition generation (#558) · 163c4795
  Tianqi Chen committed Oct 15, 2017
  
  163c4795 Browse Files
- [CODEGEN] Force not inline compute core for better debug (#557) · 10faa893
```
* [CODEGEN] Force not inline compute core for better debug

* also support llvm4
```
  Tianqi Chen committed Oct 14, 2017
  10faa893 Browse Files
14 Oct, 2017 3 commits
- [Refactor] Introduce target generic dispatch system (#556) · eb761f36
```
* [TVM] Introduce target generic dispatch system

* fix target warning
```
  Tianqi Chen committed Oct 14, 2017
  eb761f36 Browse Files
- enable rocm target for topi/recipes. add timing util to gemm test. (#554) · c3cac464
  masahi committed Oct 14, 2017
  
  c3cac464 Browse Files
- [CODEGEN] Detect broadcast(cast(x)) pattern in FMA (#551) · 592a1f65
```
* [CODEGEN] Detect broadcast(cast(x)) pattern in FMA

* [CODEGEN] Improve

* [CODEGEN] Fix
```
  ziheng committed Oct 13, 2017
  592a1f65 Browse Files
13 Oct, 2017 5 commits

Add same_as to NodeBase (#550) · fde9b570

* Add same_as to NodeBase

1. Most class inherited from NodeBase(Schedule, Stage, etc) still have
the convenience of using '==' for object identity. And this is the right
behavior for non-Expr classes.
2. subclasses of ExprOp now create EQ expression when '==' is used.

`__nonzero__` and `__bool__` in EQ and NE is a comprise that in some cases
object identity semantics is still useful, like in unit test. For instance:
````
assert a == b
````

"a == b" will create EQ expression, assert then calls `__nonzero__` of the
result expression. `Expr.__nonzero__` throws exception since it prohibits
evaluating IR expression.

More complex case like:
````
assert a in b # b is dict
````

it will call `__eq__` on a and all keys of b, then `__bool__` on the result
expression. This could not easily be done by same_as.

* Retain __hash__ from NodeBase in Python3

committed Oct 13, 2017

fde9b570 Browse Files

added support for rocm gpu autodetect (#549) · ed783689

* added support for rocm gpu autodetect

* changed type casting from old style to static_cast

* fixed code to generate gfx specific code object

* fixed namespaces

committed Oct 13, 2017

ed783689 Browse Files

add msvc in cc (#531) · 87c929f5
Hu Shiwen committed Oct 13, 2017

87c929f5 Browse Files
Add rocm target to topi tests (#548) · 85c545c7
```
* add masahi to contributors

* enable rocm target in topi tests
```
masahi committed Oct 13, 2017
85c545c7 Browse Files
[CODEGEN] Skip unrolled hint, export symbol on win32 (#547) · 74b0ca86
Tianqi Chen committed Oct 12, 2017

74b0ca86 Browse Files