Commits · 4c0a53dc5bef49797ccbb5f05b0c61ca832d7c85 · wenyuanbo / tic

10 Apr, 2020 1 commit

[REFACTOR][IR] Move to runtime::String (#5276) · 5da361d3

* Use runtime::String

* move string to tvm namespace

* add const char* constructor

* implicit cast from std::string

committed 4 years ago

5da361d3 Browse File

21 Feb, 2020 1 commit

[CODEGEN] Support cuda tensorcore subbyte int data type in auto tensorcore (#4546) · f23ac969

* support cuda tensorcore subbyte int data type in auto tensorcore

* add lisence

* pass cpplint

* fix code review comments

* merge the int4/int1 codegen tutorial into the existing auto tensorcore tutorial

* using master's new API

* disable tuning when cuda is not enabled

* address cr comment

* do not run the tuning

* fix test failure

* fix cpplint error

* fix bool type reduction bug

* 1. fix a index bug 2. fix returned bytes value of int1/int4/uint4

* fix typo

committed 5 years ago

f23ac969 Browse File

19 Jan, 2020 1 commit

[REFACTOR] Establish tir (#4740) · cf59b206

TIR is the new namespace for low-level IR
for tensor-level optimizations and loop transformations.

This PR establishes the namespace and files.

- lowered_func.h,buffer.h,data_layout.h -> tir/buffer.h,tir/data_layout.h,tir/lowered_func.h
- ir.h -> tir/expr.h, tir/stmt.h
- ir_functor_ext.h -> tir/expr_functor.h, tir/stmt_functor.h

committed 5 years ago

cf59b206 Browse File

16 Jan, 2020 1 commit

[REFACTOR][ARITH] Unified IR, introduce arith subfolder. (#4722) · c7a83199

Spread the arithmetic.h into several components and move
into arith subfolder.

The arith namespace will be used for arithmetic expression
pattern detections and simplifications.

committed 5 years ago

c7a83199 Browse Directory

15 Jan, 2020 1 commit

[REFACTOR][IR] Unify IntImm and UIntImm (#4706) · ce807fe8

* [REFACTOR][IR] Unify IntImm and UIntImm

This PR unifies UIntImm and IntImm to simplify the codebase.
Unsigned integer constants will also be stored as IntImm.

For uint constant that does not fit into int64(rare case), we introduced
an intrinsic tvm_big_uint_imm to construct such intgers by its
lower and higher 32bits.

* [REFACTOR][IR] Remove UIntImm to use IntImm

* rename big->large

committed 5 years ago

ce807fe8 Browse Directory

09 Jan, 2020 1 commit

[REFACTOR][IR] tvm::Expr -> PrimExpr(Primitive Expr) (#4669) · d6a23cf5

* [REFACTOR][IR] tvm::Expr -> PrimExpr(Primitive Expr)

As part of unified IR, we will need to unify relay::Expr
and the current tvm::Expr under the same base type.

From the techinical point of view. tvm::Expr is a "primitive"
expression that only contains POD types and handles and does
not do life-cycle management.

This PR renames Expr->PrimExpr to clarify that.
We will send a subsequent PR to introduce the base expr class.

* Remove legacy VarExpr and ExprHash/Equal

committed 5 years ago

d6a23cf5 Browse Directory

08 Jan, 2020 1 commit

[REFACTOR][IR] Add Node suffix to low-level IR nodes (#4649) · f4c5f93b

* [REFACTOR][IR] Variable -> VarNode

* [REFACTOR][IR] Add/Sub/Mul/Div -> AddNode/SubNode etc.

* [REFACTOR][IR] Min/Max/FloorDiv/FloorMod -> MinNode/MaxNode etc.

* [REFACTOR][IR] EQ/NE/LT/LE/GT/GE/Select -> EQNode/NENode etc.

* [REFACTOR][IR] Add Node suffix to Select/Call/Load/Ramp/Shuffle/Let

* [REFACTOR][IR] Add node suffix to IntImm/UIntImm/FloatImm/StringImm

* [REFACTOR][IR] Add Node suffix to Any, AttrStmt, AssertStmt

* [REFACTOR][IR] Add Node suffix to Store/Provide/Allocate/Free

* [REFACTOR][IR] Add Node suffix to ProducerConsumer

* Fix lint

* style updates, test fixes

committed 5 years ago

f4c5f93b Browse Directory

06 Jan, 2020 1 commit

[REFACTOR][IR] Introduce SeqStmt to replace ir::Block (#4627) · 3595cbe0

* [REFACTOR][IR] Introduce SeqStmt to replace Block

ir::Block was used to represent a sequence of Stmts in the original low-level IR.
The nested ir::Block structure is not really friendly for recursive visits,
especially when the statements are unrolled.

This PR introduce a SeqStmt that directly stores a sequence of statements in an Array container.
The new SeqStmt will be used as a replacement of the original Block structure.

* [REFACTOR] Migrate use of Block to SeqStmt.

* [REFACTOR] Remove Block

* Add more comments per yizhi's comment

committed 5 years ago

3595cbe0 Browse Directory

22 Dec, 2019 1 commit

[REFACTOR][DTYPE] Isolate dtype to runtime (#4560) · 7fa8aab5

dtype.h -> runtime/data_type.h

Changes:
- Rename all old reference of tvm::Type to DataType
- ExprNode.type -> ExprNode.dtype
- Expr.type() -> Expr.dtype()
- Change Expr related functions to expr_operator.
  - DataType::min() -> min_value(DataType)
  - DataType::max() -> max_value(DataType)
- Move type constructor Int, UInt, Float, Handle, Bool into DataType.
  - Int(bits) -> DataType::Int(bits)
  - UInt(bits) -> DataType::UInt(bits)

committed 5 years ago

7fa8aab5 Browse Directory

21 Oct, 2019 1 commit

[REFACTOR][NODE][RUNTIME] Move Node to the new Object protocol. (#4161) · 7895adb2

* [REFACTOR][NODE][RUNTIME] Move Node to the new Object protocol.

This PR removes the original node system, and make node as a subclass of Object.
This is a major refactor towards a better unified runtime object system.

List of changes in the refactor:

- We now hide data_ field, use Downcast explicitly to get a sub-class object.
- Removed the node system FFI in python.
- Removed the node C API, instead use PackedFunc for list and get attrs.
- Change relay::Op::set_attr_type_key(attr_key_name) to relay::Op::set_attr_type<AttrType>().
  - This change was necessary because of the new Object registration mechanism.
  - Subsequent changes to the op registrations
  - The change revealed a few previous problems that is now fixed.
- Patched up a few missing node type registration.
  - Now we will raise an error if we register object that is not registered.
- The original node.h and container.h are kept in the same location.
- Calling convention: kObjectHandle now equals the old kNodeHandle, kNodeHandle is removed.
- IRFunctor now dispatches on ObjectRef.
- Update to the new type checking API: is_type, derived_from are replaced by IsInstance.
- Removed .hash member function, instead use C++ convention hasher functors.

* Address review comments

committed 5 years ago

7895adb2 Browse Directory

25 Sep, 2019 2 commits

[ARITH] Refactor to use explicit div/mod functions instead of operators. (#4000) · f0079a57
```
* [ARITH] Use explicit div/mod functions instead of operators.

* fix pooling case
```
Tianqi Chen committed 5 years ago
f0079a57 Browse Directory

Changes to make tensorize work. These changes also fix the previously broken test. (#3981) · b410df8c

* Changes to make tensorize work. These changes also fix the previously
broken test.

Summary:
Tensorize was breaking  for a few reasons.
1)
Assert at: src/op/tensorize.cc:234 CHECK(is_one(e.region[j]->extent))
In some cases this cannot be proven, e.g.:
expected shape=[16, 4], given region=[range(min=((ax1.outer*16)/16), ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)), range(min=((k.outer*4)/4), ext=(((((k.outer*4) + 3)/4) + 1) - k.outer)), range(min=0, ext=16), range(min=0, ext=4)]
The unprovable one is: ext=(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer)).
This can be simplified but it is not because to simplify divide, it must
prove ax1.outer > 0 and since it is var it cannot. The fix for this to
just find all the vars in expr in relace them with some const value.

2) Equivalence between tensorized expr and one being asked to tensorize. For example,
the error would be.
TVMError: Check failed: Equal(lhs, rhs):
Failed to match the compute with TensorIntrin tensor_intrin's declaration
provided= reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0),
intrin=  reduce(combiner=comm_reducer(result=[(x + y)], lhs=[x], rhs=[y], identity_element=[(int16)0]), source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))], where=(bool)1, value_index=0)
Difference is mainly in the source part:
source=[(int16(data(k))*int16(kernel(((((((((k.outer.outer*64) + (k.outer.inner*2)) + k)/2)*128) + i) - (k.outer.inner*128)) - (k.outer.outer*4096)), ((((k.outer.outer*64) + (k.outer.inner*2)) + k) % 2))))]
source=[(int16(data(k))*int16(kernel(i, k)))], axis=[iter_var(k, range(min=0, ext=2))]
This was not being simpifiled due to compute_intrin_iter_space (map for
iter var to range) not containing leaf iter vars.

3) Here it fails with:
Check failed: is_one(Simplify(value->shape[i])): Argument b_buffer shape mismatch[16, 4] vs [(((((ax1.outer*16) + 15)/16) + 1) - ax1.outer), (((((k.outer*4) + 3)/4) + 1) - k.outer), 16, 4]
This is in buffer binding where it thinks expected and buffer bound
shape is different. Although if we could simplify expr, this would not
be the case.

Test Plan:
On skylake avx512 machine:
python tests/python/contrib/test_gemm_acc16.py

Reviewers:

Subscribers:

Tasks:

Tags:

* Implemented bounded analyzer which traverses tree and for reduce/for
statements binds the bound of the analyzer. Later this is used to
simplify expressions. Inspired from ir_mutator_with_analyzer

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Addressed comments.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Added ASF header + define macro for the header file: TVM_ARITHMETIC_IR_VISITOR_WITH_ANALYZER_H_
Some lint fixes as well.

* Relax the assumption that dom_map must always contain all leaf itervars.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

* Disable copy constructor and move to raw ptr.

Summary:

Test Plan:

Reviewers:

Subscribers:

Tasks:

Tags:

committed 5 years ago

b410df8c Browse Directory

17 Aug, 2019 1 commit
- Fix ArgBinder assert order (#3794) · f9d8d063
  Wuwei Lin committed 5 years ago
  
  f9d8d063 Browse Directory
06 Jul, 2019 1 commit
- [ARITH] Refactor: Remove un-necessary usage of ComputeExpr (#3503) · 59448fed
  Tianqi Chen committed 5 years ago
  
  59448fed Browse Directory
02 Jul, 2019 1 commit

[Codegen] Support broadcast op with symbolic shape (#3389) · 0af5c216

* [Codegen] Support broadcast op with symbolic shape

* fix case where last dim = 1

* use enum; simplify stride calculation; improve doc

* fix lint

* improve py doc

committed 5 years ago

0af5c216 Browse Directory

08 Apr, 2019 1 commit

[HEADER] Add Header to Comply with ASF Release Policy (#2982) · cffb4fba

* [HEADER] ASF header dir=include

* [HEADER] ASF Header dir=src

* [HEADER] ASF Header -dir=python

* [HEADER] ASF header dir=topi

* [HEADER] ASF Header dir=nnvm

* [HEADER] ASF Header -dir=tutorials

* [HEADER] ASF Header dir=tests

* [HEADER] ASF Header -dir=docker

* fix whitespace

* [HEADER] ASF Header -dir=jvm

* [HEADER] ASF Header -dir=web

* [HEADER] ASF Header --dir=apps

* [HEADER] ASF Header --dir=vta

* [HEADER] ASF Header -dir=go

* temp

* [HEADER] ASF Header --dir=rust

* [HEADER] Add ASF Header --dir=cmake

* [HEADER] ASF Header --dir=docs

* [HEADER] Header for Jenkinsfile

* [HEADER] ASF Header to toml and md

* [HEADER] ASF Header to gradle

* Finalize rat cleanup

* Fix permission

* Fix java test

* temporary remove nnvm onnx test

committed 5 years ago

cffb4fba Browse Directory

06 Oct, 2018 1 commit
- [LANG] Generalize compute to tensor region (#1476) · b90620ea
  ziheng committed 6 years ago
  
  b90620ea Browse Directory
23 Aug, 2018 1 commit
- Remove leading "./" from include paths (#1640) · b95b5958
  MORITA Kazutaka committed 6 years ago
  
  b95b5958 Browse Directory
26 Jul, 2018 1 commit
- Fix more type annotation (#1490) · b0ef376a
  Tianqi Chen committed 6 years ago
  
  b0ef376a Browse Directory
09 Mar, 2018 1 commit
- Assert dont crash on null strides (#976) · 537b70e4
  Chris Nuernberger committed 6 years ago
  
  537b70e4 Browse Directory
04 Dec, 2017 1 commit
- Support rank-0 tensor (#687) · f2b91392
```
* Support rank-0 tensor

* fix lint
```
  Tianqi Chen committed 7 years ago
  f2b91392 Browse Directory
25 Nov, 2017 1 commit
- [PASS] Allow compact checking when strides is available (#669) · b55361b4
```
* [PASS] Allow compact checking when strides is available

* remove assert compact
```
  Tianqi Chen committed 7 years ago
  b55361b4 Browse Directory
24 Jul, 2017 1 commit
- [STORAGE][BUFFER] Support access ptr for clear access pattern. (#266) · 7e3d9da4
```
* [STORAGE][BUFFER] Support access ptr for clear access pattern.

* fix lint
```
  Tianqi Chen committed 7 years ago
  7e3d9da4 Browse Directory
06 Jul, 2017 3 commits
- [C API] Make DSL API registerable, add copy from/to raw bytes (#222) · 28120f55
```
* [C API] Make DSL API registerable, add copy from/to raw bytes

* fix cython
```
  Tianqi Chen committed 7 years ago
  28120f55 Browse Directory
- [CODEGEN/PASS] add restricted, alignment option (#221) · 0a19b16a
```
* [CODEGEN/PASS] add restricted, alignment option

* fix lint

* Fix the alloca
```
  Tianqi Chen committed 7 years ago
  0a19b16a Browse Directory
- [IR] Add body to AssertStmt (#220) · 00506a62
```
* [IR] Add body to AssertStmt

* fix lint
```
  Tianqi Chen committed 7 years ago
  00506a62 Browse Directory
05 Jul, 2017 1 commit
- [PASS/OP/REFACTOR] IRDeepCompare, isolate computeop part, allow fuzzy bind (#218) · 8a66ac23
  Tianqi Chen committed 7 years ago
  
  8a66ac23 Browse Directory
04 Jul, 2017 1 commit
- [REFACTOR/PASS] Formalize argument bind and match util (#214) · 4bb3c35a
```
* [REFACTOR/PASS] Formalize argument bind and match util

* grammar
```
  Tianqi Chen committed 7 years ago
  4bb3c35a Browse Directory