Commits · 77bdd5f78a09854335f6fd8e4fb7bc7d45e2756f · wenyuanbo / tic

03 Dec, 2019 2 commits
- [MEMORY] Fix gcc 4.8 compact (#4461) · 119c5c9c
  Tianqi Chen committed 5 years ago
  
  119c5c9c Browse Directory
- Fix MSVC build error with container.h (#4455) · 239d4371
  jmorrill committed 5 years ago
  
  239d4371 Browse Directory
01 Dec, 2019 1 commit
- [Runtime] Make ADTObject POD container type (#4346) · 2bf5fd2b
  Wei Chen committed 5 years ago
  
  2bf5fd2b Browse Directory
26 Nov, 2019 1 commit
- Allow Array/Map store objects that are not NodeRef (#4430) · e35ecae8
  Junru Shao committed 5 years ago
  
  e35ecae8 Browse Directory
25 Nov, 2019 1 commit

[Perf] Enhance cudnn and cublas backend and enable TensorCore (#4353) · dabde40f

* add half and mix precision support to cublas backend

* add TensorCore support in CuDNN

* enhance CuDNN support

* address comments and fix lint

* fix

* add fp16 test

committed 5 years ago

dabde40f Browse Directory

24 Nov, 2019 4 commits
- [RUNTIME] rename allocator.make -> allocator.make_object for term consistency (#4416) · fbb2a356
  Tianqi Chen committed 5 years ago
  
  fbb2a356 Browse Directory
- [LICENSE] clarify the blockingqueue license, update version to 0.6.0 (#4414) · 2f1685fe
  Tianqi Chen committed 5 years ago
  
  2f1685fe Browse Directory
- [LINT] Remove unnecessary copyright message for files with ASF header (#4409) · c8772288
```
* [LINT] Improve the check tool to handle ASF copyright message.

* [LINT] Remove unnecessary copyright message as per ASF requirement.

* Fix codegen hybrid

* [LINT] Broaden license checks to include html, xml

* [LINT] Fix rest of the files

* Fix notice

* [LINT] Improve check file type error message
```
  Tianqi Chen committed 5 years ago
  c8772288 Browse Directory
- [Release] resolve license issues (#4408) · 8ba1d7d1
  Yizhi Liu committed 5 years ago
  
  8ba1d7d1 Browse Directory
23 Nov, 2019 1 commit
- [RUNTIME] Move module export to the function level. (#4405) · 87bd799e
  Tianqi Chen committed 5 years ago
  
  87bd799e Browse Directory
22 Nov, 2019 2 commits
- [TVM][RUNTIME] A minimum example to generate external library wrappers for DSOModule (#4280) · e0810512
  Zhi committed 5 years ago
  
  e0810512 Browse Directory
- [Relay][VM] Clean up the VM and VM profiler code (#4391) · 122a4930
```
* [VM] add a few more API to vm

* [VM][Fix] fix vm convert args

* [VM] a few fixes

* rename fields

* update

* update vm profiler

* x

* add doc

* lint

* fix test

* address comments
```
  Haichen Shen committed 5 years ago
  122a4930 Browse Directory
19 Nov, 2019 1 commit
- [Relay tests] AlterOpLayout - Temporary attr update (#4357) · 26eb4053
  Animesh Jain committed 5 years ago
  
  26eb4053 Browse Directory
16 Nov, 2019 1 commit

Retain qnn input kernel scales (#4292) · 3ba9dd09

* Add qnn conv2d attributes for input_tensor_scale and
kernel_tensor_scale.

The lowering in the tflite frontend loses the input_tensor_scale
and the kernel_tensor_scale by multiplying it and putting it into
the Requantize operation. This means that any graph partitioning
passes or other passes that need to access this information no longer
have it available in the qnn dialect.

regards
Ramana

* Store input tensor scale and Weight tensor scale for Dense as well

As for conv2d, the tflite frontend drops the input tensor
scale and the weight tensor scale from the relay op. Store
it as separate fields in there.

* Fix unintentional tab

* Rename input_tensor_scale to input_scale and kernel_tensor_scale
to kernel_scale for conv2d.

* input_tensor_scale -> input_scale weight_tensor_scale->weight_scale

* Rework dense testcase

And use input_scale and kernel_scale

* Be consistent in use of input_scale and kernel_scale values

* Fixup qnn conv2d tests for input_scale and kernel_scale

* Make pydoc identical between conv2d and dense for weight_tensor

* Fix up conv2d parameters to be in the same order between C++ and python

* Fix ordering of parameters for dense.

* Add input_scale and output_scale to try and satisfy ci gods

* Delete input_scale and kernel_scale.

nn.conv2d does not contain input_scale and kernel_scale. We need
to delete it when lowering it to nn.conv2d.

* Add input_scale and kernel_scale for qnn.conv2d

committed 5 years ago

3ba9dd09 Browse Directory

15 Nov, 2019 3 commits
- [Relay][VM][Interpreter] Enable first-class constructors in VM and interpreter… · 2c5c4da6
```
[Relay][VM][Interpreter] Enable first-class constructors in VM and interpreter via eta expansion (#4218)

* Fix constructor pretty printing

* Make Module::HasDef name consistent with API

* Add VM constructor compilation via eta expansion

* Lint

* Fix CI

* Fix failing test

* Address comment

* Retrigger CI

* Retrigger CI
```
  Logan Weber committed 5 years ago
  2c5c4da6 Browse Directory
- [CodeGen] Add build config option disable_assert to control whether to generate assert (#4340) · b0b16a07
  Zhao Wu committed 5 years ago
  
  b0b16a07 Browse Directory
- [RUNTIME] Add device query for AMD GcnArch (#4341) · 0235d283
```
* add gcnArch query

* kGcnArch query for cuda is a no-op
```
  Peter Yeh committed 5 years ago
  0235d283 Browse Directory
11 Nov, 2019 2 commits

[RUNTIME][REFACTOR] Use object protocol to support runtime::Module (#4289) · f823c577

Previously runtime::Module was supported using shared_ptr.
This PR refactors the codebase to use the Object protocol.

It will open doors to allow easier interpolation between
Object containers and module in the future.

committed 5 years ago

f823c577 Browse Directory

[tutorial] Relay pass infra tutorial (#4083) · cff62bdb

* Add pass manager tutorial

* fix some examples

* retrigger ci

* Update tutorials/dev/relay_pass_infra.py

Co-Authored-By: 雾雨魔理沙 <lolisa@marisa.moe>

* Add ToANormalForm link

committed 5 years ago

cff62bdb Browse Directory

09 Nov, 2019 1 commit

Auto TensorCore CodeGen (#4234) · d64bf6b5

* Add Auto TensorCore TensorCore Unit Test

* Rebase to tvm master branch & Add auto tensor core

* Code Refine

* Add tensor core switch by pragma

* Add pragma in tensor core example code

* Get real tile size to replace hard coded 16

* support more than 2 dimensions (e.g. batchmatmul) for buffer bind scope

* support batch matmul

* Move cuda env check to tensor_core.cc

* Coderefine for tensor_core.cc

* Refine comments

* Some refinements of code and comment

* Update TensorCore UT to pass the CPU test

* remove redundant code

* matmul's storage align for different layout

* Add support for differenct position of type cast

* Add formal tutorial for auto tensorcore codegen

* move tensorcore check up to tutorial code

* code and doc refine

* comment out tune_and_evaluate in tutorial

* fix cpplint error

committed 5 years ago

d64bf6b5 Browse Directory

01 Nov, 2019 2 commits

[NODE][REFACTOR] Rename IRFunctor->NodeFunctor, use func pointer (#4247) · 9a3d2ec9

* [NODE][REFACTOR] Rename IRFunctor->NodeFunctor, use function pointer for dispatching.

Previously we used std::function for the functor dispatching.
It introduces additional overhead and problems during dll destruction(of std::function).

This PR changes the std::function to function pointers.
This change a bit restrictions around the set_dispatch that we can get around,
but will improve the general efficiency by reducing one level of indirection in the std::function.
We also no longer need special marcos to register functions to the Functor.

committed 5 years ago

9a3d2ec9 Browse Directory

Implement explicit IR representation of memory alloction (#3560) · 2083513f
Jared Roesch committed 5 years ago

2083513f Browse Directory

30 Oct, 2019 3 commits

[Relay][Topi][TensorFlow][ONNX][Lang] Add support for Any op (#4205) · b07b1952
```
* Add support for Any op

* Support ONNX frontend

* Add doc

* Add to relay docs

* Dummy change to retrigger CI
```
Jon Soifer committed 5 years ago
b07b1952 Browse Directory

Improve the lowering of Qnn Dense (#4213) · 2be444f9

* [QNN] Improving Dense lowering.

* - Moving get_shape method to util
- Finalizing the test cases and the code structure for optimized dense computation.

* - Fixing cpplint.

* - Addressing review comments.

* - Renaming the variables correctly.

* - Renaming the variables correctly.

committed 5 years ago

2be444f9 Browse Directory

Fix typo in packed_func.h (#4219) · 50e4aa0d
Bohan Hou committed 5 years ago

50e4aa0d Browse Directory

28 Oct, 2019 1 commit

[Relay][Op] Enhance Upsample Operator to support float scales (#4206) · 8b1fb4d5

* :add scale2 for upsample

* update unit test for upsampling

* support latest upsample op for multiple frontend

* fix lint

* fix lint

* fix lint

* fix lint

* update scale description and rebase

committed 5 years ago

8b1fb4d5 Browse Directory

27 Oct, 2019 1 commit
- [Relay][Params] Add APIs for storing and retrieving parameters from individual functions. (#4194) · 9cc78741
```
* Add support for attaching params

* Fix types

* Fix test
```
  Jared Roesch committed 5 years ago
  9cc78741 Browse Directory
24 Oct, 2019 4 commits

hotfix the ci (#4199) · decccd6c
Tianqi Chen committed 5 years ago

decccd6c Browse Directory

[NODE][REFACTOR] Refactor reflection system in node. (#4189) · 78ca6fc8

* [NODE][REFACTOR] Refactor reflection system in node.

- Removed the old Node, Node is now just an alias of runtime::Object
- Introduce ReflectionVTable, a new columnar dispatcher to support reflection
  - This allows us to remove vtable from most node objects
  - The VisitAttrs are registered via TVM_RESGITER_NODE_TYPE,
    they are no longer virtual.
- Consolidated serialization and reflection features into node.

* Explicit type qualification when calling destructor.

* Fix SPIRV, more comments

committed 5 years ago

78ca6fc8 Browse Directory

TensorCore Support using Intrinsic (#4136) · 324a9607

* add tensor core support

* avoid memory bank conflict

* fix thread sync & better performance

* better performance

* add schedule test for conv2d

* extend into BatchMatMul

* support config fragment shape and layout using intrinsic

* add TensorCore tutorial

* add int support and fix lint

* address comment

* add 32*16*8 TensorCore test

* fix wmma include logic

committed 5 years ago

324a9607 Browse Directory

[Relay] Fix memory leak in the interpreter (#4155) · 2e0dbaa6
```
* save

lint

* address reviewer comment
```
雾雨魔理沙 committed 5 years ago
2e0dbaa6 Browse Directory

22 Oct, 2019 1 commit
- [relay][vm] Reuse allocated device memory (#4170) · 5a177070
  Zhi committed 5 years ago
  
  5a177070 Browse Directory
21 Oct, 2019 1 commit

[REFACTOR][NODE][RUNTIME] Move Node to the new Object protocol. (#4161) · 7895adb2

* [REFACTOR][NODE][RUNTIME] Move Node to the new Object protocol.

This PR removes the original node system, and make node as a subclass of Object.
This is a major refactor towards a better unified runtime object system.

List of changes in the refactor:

- We now hide data_ field, use Downcast explicitly to get a sub-class object.
- Removed the node system FFI in python.
- Removed the node C API, instead use PackedFunc for list and get attrs.
- Change relay::Op::set_attr_type_key(attr_key_name) to relay::Op::set_attr_type<AttrType>().
  - This change was necessary because of the new Object registration mechanism.
  - Subsequent changes to the op registrations
  - The change revealed a few previous problems that is now fixed.
- Patched up a few missing node type registration.
  - Now we will raise an error if we register object that is not registered.
- The original node.h and container.h are kept in the same location.
- Calling convention: kObjectHandle now equals the old kNodeHandle, kNodeHandle is removed.
- IRFunctor now dispatches on ObjectRef.
- Update to the new type checking API: is_type, derived_from are replaced by IsInstance.
- Removed .hash member function, instead use C++ convention hasher functors.

* Address review comments

committed 5 years ago

7895adb2 Browse Directory

20 Oct, 2019 1 commit

[Refactor] Rename Datatype to ADT (#4156) · 32aad56c

We think it will reduce the confusion with the meaning.

https://discuss.tvm.ai/t/discuss-consider-rename-vm-datatype/4339

committed 5 years ago

32aad56c Browse Directory

18 Oct, 2019 1 commit

Add lift_if_then_else pass (#3865) · 687d4a83

* Add LiftIfThenElse pass

* Add more comments

* Rename and refactor

* Add description for internal data structure

* Rename a test

* Minor change

* Address comments

* Improve update_for

committed 5 years ago

687d4a83 Browse Directory

17 Oct, 2019 1 commit

[relay][vm] Separate VM runtime with executable (#4100) · 4052de6d

* [relay][vm] Separate VM runtime with executable

* Address comments

* move ctx back to vm

* make only vm related fields and methods protected

* integrate seriliaztion/deserialization to executable

* create stream

committed 5 years ago

4052de6d Browse Directory

16 Oct, 2019 2 commits

[RUNTIME] Refactor object python FFI to new protocol. (#4128) · 02c1e117

* [RUNTIME] Refactor object python FFI to new protocol.

This is a pre-req to bring the Node system under object protocol.
Most of the code reflects the current code in the Node system.

- Use new instead of init so subclass can define their own constructors
- Allow register via name, besides type idnex
- Introduce necessary runtime C API functions
- Refactored Tensor and Datatype to directly use constructor.

* address review comments

committed 5 years ago

02c1e117 Browse Directory

[QNN] Change default rouning to UPWARD. (#4131) · 1c0e7435
Animesh Jain committed 5 years ago

1c0e7435 Browse Directory

15 Oct, 2019 1 commit

[RFC][RUNTIME] Introduce new object protocol. (#4115) · a0bd3786

* [RUNTIME] Introduce new object protocol.

This PR introduces a new object protocol to unify the node and object.
We also updated the existing runtime::vm code to make use of the new system.

Update to the node will be done in a follow up PR.

Other changes:

- Remove object related code in json serializer as that code logic was not complete
  and we have a separate serializer for VM, can revisit later.

* address review  comment

* Fix the child slot logic

committed 5 years ago

a0bd3786 Browse Directory

11 Oct, 2019 1 commit

[tvm][any] broadcast with values other than one (#3967) · 9d5cba20

* [tvm][any] broadcast with values other than 1

* Add test for incompatible runtime values

* Remove hybrid script compact buffer binding

* retrigger ci

committed 5 years ago

9d5cba20 Browse Directory