Commits · 1b053ec00fdf2f9d74a3757839de41b358517287 · wenyuanbo / tic

04 Nov, 2019 1 commit
- Fix typo in err msg (#4251) · 1b053ec0
  XFPlus committed 5 years ago
  
  1b053ec0 Browse Directory
01 Nov, 2019 4 commits

[NODE][REFACTOR] Rename IRFunctor->NodeFunctor, use func pointer (#4247) · 9a3d2ec9

* [NODE][REFACTOR] Rename IRFunctor->NodeFunctor, use function pointer for dispatching.

Previously we used std::function for the functor dispatching.
It introduces additional overhead and problems during dll destruction(of std::function).

This PR changes the std::function to function pointers.
This change a bit restrictions around the set_dispatch that we can get around,
but will improve the general efficiency by reducing one level of indirection in the std::function.
We also no longer need special marcos to register functions to the Functor.

committed 5 years ago

9a3d2ec9 Browse Directory

Implement explicit IR representation of memory alloction (#3560) · 2083513f
Jared Roesch committed 5 years ago

2083513f Browse Directory
[Relay][Pass] Avoid FoldConstant folding some ops (#4245) · aa49e851
```
* [Relay][Pass] Avoid FoldConstant folding some ops

* rename
```
Wuwei Lin committed 5 years ago
aa49e851 Browse Directory
[ARITH] Fix lowering of FloorMod (#4236) · bafc675c
Sergei Grechanik committed 5 years ago

bafc675c Browse Directory

31 Oct, 2019 1 commit
- [CUDA] Fix fp16 intrin, disable bad fp16 vecadd test for now (#4239) · ebfcd28c
  Tianqi Chen committed 5 years ago
  
  ebfcd28c Browse Directory
30 Oct, 2019 3 commits

[Relay][Topi][TensorFlow][ONNX][Lang] Add support for Any op (#4205) · b07b1952
```
* Add support for Any op

* Support ONNX frontend

* Add doc

* Add to relay docs

* Dummy change to retrigger CI
```
Jon Soifer committed 5 years ago
b07b1952 Browse Directory
[ARITH] Fix the rule y < x && x <= y (#4220) · fc020b87
Sergei Grechanik committed 5 years ago

fc020b87 Browse Directory

Improve the lowering of Qnn Dense (#4213) · 2be444f9

* [QNN] Improving Dense lowering.

* - Moving get_shape method to util
- Finalizing the test cases and the code structure for optimized dense computation.

* - Fixing cpplint.

* - Addressing review comments.

* - Renaming the variables correctly.

* - Renaming the variables correctly.

committed 5 years ago

2be444f9 Browse Directory

29 Oct, 2019 2 commits

Optimizing autotvm task extraction speed (#4138) · 2386e74b

* Optimize task extraction speed

* correct pylint errors

* Delete unused function

* remove unnecessary argument

* resolve code review comments

* corrent cpp lint errors

* remove one more graph_json return value

* fix test bugs

committed 5 years ago

2386e74b Browse Directory

[Relay][Quantize] Use fixed point mulplications (#4160) · e8899285
Wuwei Lin committed 5 years ago

e8899285 Browse Directory

28 Oct, 2019 2 commits
- [Relay][Op] Enhance Upsample Operator to support float scales (#4206) · 8b1fb4d5
```
* :add scale2 for upsample

* update unit test for upsampling

* support latest upsample op for multiple frontend

* fix lint

* fix lint

* fix lint

* fix lint

* update scale description and rebase
```
  Xingyu Zhou committed 5 years ago
  8b1fb4d5 Browse Directory
- [Relay] Setting Legalize opt_level to 1. (#4198) · 780a9945
  Animesh Jain committed 5 years ago
  
  780a9945 Browse Directory
27 Oct, 2019 3 commits
- [RUNTIME] Separate runtime related contrib into runtime/contrib (#4207) · dcc6af53
  Tianqi Chen committed 5 years ago
  
  dcc6af53 Browse Directory
- [Relay][Params] Add APIs for storing and retrieving parameters from individual functions. (#4194) · 9cc78741
```
* Add support for attaching params

* Fix types

* Fix test
```
  Jared Roesch committed 5 years ago
  9cc78741 Browse Directory
- [Relay][Training] Add checkpoint annotation for checkpointing memory optimization (#4146) · 93d610a1
```
* add checkpoint annotation for checkpointing memory optimization

* add alpha-equivalence checkpoint test and fix gradient type issue

* fix build issues

* ignore checkpoint annotation when checking missing gradients

* refactor, fix checkpoint compute for tuple and add tests
```
  Altan Haan committed 5 years ago
  93d610a1 Browse Directory
25 Oct, 2019 2 commits
- [hotfix] missing include headers (#4204) · 7732873e
  Zhi committed 5 years ago
  
  7732873e Browse Directory
- [Relay] crossentropy_with_logits and its gradient (#4075) · 1ad6a2af
```
* save

* lint
```
  雾雨魔理沙 committed 5 years ago
  1ad6a2af Browse Directory
24 Oct, 2019 4 commits

[NODE][REFACTOR] Refactor reflection system in node. (#4189) · 78ca6fc8

* [NODE][REFACTOR] Refactor reflection system in node.

- Removed the old Node, Node is now just an alias of runtime::Object
- Introduce ReflectionVTable, a new columnar dispatcher to support reflection
  - This allows us to remove vtable from most node objects
  - The VisitAttrs are registered via TVM_RESGITER_NODE_TYPE,
    they are no longer virtual.
- Consolidated serialization and reflection features into node.

* Explicit type qualification when calling destructor.

* Fix SPIRV, more comments

committed 5 years ago

78ca6fc8 Browse Directory

TensorCore Support using Intrinsic (#4136) · 324a9607

* add tensor core support

* avoid memory bank conflict

* fix thread sync & better performance

* better performance

* add schedule test for conv2d

* extend into BatchMatMul

* support config fragment shape and layout using intrinsic

* add TensorCore tutorial

* add int support and fix lint

* address comment

* add 32*16*8 TensorCore test

* fix wmma include logic

committed 5 years ago

324a9607 Browse Directory

[TOPI] Tunable Template for Conv2D HWCN on CUDA (#4168) · 4ab73634
```
* support conv2d HWCN in AutoTVM and Relay

* fix lint

* fix comments and unit tests
```
Cody Hao Yu committed 5 years ago
4ab73634 Browse Directory
[Relay] Fix memory leak in the interpreter (#4155) · 2e0dbaa6
```
* save

lint

* address reviewer comment
```
雾雨魔理沙 committed 5 years ago
2e0dbaa6 Browse Directory

23 Oct, 2019 2 commits

[rpc] use callback func to do send & recv (#4147) · 5408d3a3

* [rpc] use callback func to do send & recv. don't get fd from sock as it is deprecated in java

* fix java build

* fix min/max macro define in windows

* keep the old rpc setup for py

* add doc for CallbackChannel

committed 5 years ago

5408d3a3 Browse Directory

[Pass] Remove dead code (#4177) · a7404230
Wei Chen committed 5 years ago

a7404230 Browse Directory

22 Oct, 2019 2 commits
- add missing gradient check to gradient pass (#4169) · c3f02c4b
  Altan Haan committed 5 years ago
  
  c3f02c4b Browse Directory
- [relay][vm] Reuse allocated device memory (#4170) · 5a177070
  Zhi committed 5 years ago
  
  5a177070 Browse Directory
21 Oct, 2019 3 commits

[Relay][Pass] Count MAC for BatchMatMul (#4157) · e0d286a1
```
* count MAC for BatchMatMul

* update doc
```
Haichen Shen committed 5 years ago
e0d286a1 Browse Directory

Add support for quantized multiply to Relay (#4141) · e5835425

This patch adds multiply operator for quantized tensors.
The details of the quantized multiplication are outlined
in the code.

This builds on pull request 3927 and includes the changes
Animesh mentions in the comments on that request.

Change-Id: I555715b53d0266a91d5c03dc3dfe8fc31e7ce4e1

committed 5 years ago

e5835425 Browse Directory

[REFACTOR][NODE][RUNTIME] Move Node to the new Object protocol. (#4161) · 7895adb2

* [REFACTOR][NODE][RUNTIME] Move Node to the new Object protocol.

This PR removes the original node system, and make node as a subclass of Object.
This is a major refactor towards a better unified runtime object system.

List of changes in the refactor:

- We now hide data_ field, use Downcast explicitly to get a sub-class object.
- Removed the node system FFI in python.
- Removed the node C API, instead use PackedFunc for list and get attrs.
- Change relay::Op::set_attr_type_key(attr_key_name) to relay::Op::set_attr_type<AttrType>().
  - This change was necessary because of the new Object registration mechanism.
  - Subsequent changes to the op registrations
  - The change revealed a few previous problems that is now fixed.
- Patched up a few missing node type registration.
  - Now we will raise an error if we register object that is not registered.
- The original node.h and container.h are kept in the same location.
- Calling convention: kObjectHandle now equals the old kNodeHandle, kNodeHandle is removed.
- IRFunctor now dispatches on ObjectRef.
- Update to the new type checking API: is_type, derived_from are replaced by IsInstance.
- Removed .hash member function, instead use C++ convention hasher functors.

* Address review comments

committed 5 years ago

7895adb2 Browse Directory

20 Oct, 2019 2 commits
- [Runtime] Enable option to use OpenMP thread pool (#4089) · 97ea31c8
  Haichen Shen committed 5 years ago
  
  97ea31c8 Browse Directory
- [Refactor] Rename Datatype to ADT (#4156) · 32aad56c
```
We think it will reduce the confusion with the meaning.

https://discuss.tvm.ai/t/discuss-consider-rename-vm-datatype/4339
```
  Wei Chen committed 5 years ago
  32aad56c Browse Directory
18 Oct, 2019 3 commits

Add lift_if_then_else pass (#3865) · 687d4a83

* Add LiftIfThenElse pass

* Add more comments

* Rename and refactor

* Add description for internal data structure

* Rename a test

* Minor change

* Address comments

* Improve update_for

committed 5 years ago

687d4a83 Browse Directory

Fix typo (#4144) · fe418ecd
Gus Smith committed 5 years ago

fe418ecd Browse Directory

[Relay][Frontend][TF] Add tensor array ops (#3798) · 36a96773

* [Relay][Frontend][TF] Add tensor array ops

* rename

* delete test

* Move utility function

* Refactor

* fix tensor array ops

* fix test

* fix rebase

* Fix serializer bug

* Improve tf convert name lookup to use prelude api

* Fix lint

* Fix test

committed 5 years ago

36a96773 Browse Directory

17 Oct, 2019 3 commits

[relay][vm] Separate VM runtime with executable (#4100) · 4052de6d

* [relay][vm] Separate VM runtime with executable

* Address comments

* move ctx back to vm

* make only vm related fields and methods protected

* integrate seriliaztion/deserialization to executable

* create stream

committed 5 years ago

4052de6d Browse Directory

[PATCH] Fix undefined __floatdihf in libtvmruntime.so on aarch64. (#4119) · cf046972

Arm architecture provides optional FP16 floating point support in two alternative formats, IEEE and an an alternative Arm format.

The ACLE (Arm C Language Extension) defined preprocessor symbol __ARM_FP16_FORMAT_IEEE can be used to distinguish between implementations providing IEEE and the Arm alternative format, but cannot, on its own, be used to determined if FP16 HW support is actually present.

Testing this preprocessor symbol can lead to undefined __floatdihf at runtime on an aarch64 target where no FP16 HW is present.

The relevant preprocessor symbol to determine whether FP16 HW support is present in the target is __ARM_FEATURE_FP16_SCALAR_ARITHMETIC, this symbol implies __ARM_FP16_FORMAT_IEEE.

The relevant preprocessor symbols are defined by the ACLE standard, section 5.5.21 16-bit floating-point data processing operations, https://static.docs.arm.com/101028/0008/Q2-ACLE_2019Q2_release-0008.pdf

committed 5 years ago

cf046972 Browse Directory

[Relay] Improve build error when no lowered funcs are produced (#4132) · 3185e4ad
```
* Improve build error when no lowered funcs

* Switch from fatal to warning
```
Logan Weber committed 5 years ago
3185e4ad Browse Directory

16 Oct, 2019 3 commits

[RUNTIME] Refactor object python FFI to new protocol. (#4128) · 02c1e117

* [RUNTIME] Refactor object python FFI to new protocol.

This is a pre-req to bring the Node system under object protocol.
Most of the code reflects the current code in the Node system.

- Use new instead of init so subclass can define their own constructors
- Allow register via name, besides type idnex
- Introduce necessary runtime C API functions
- Refactored Tensor and Datatype to directly use constructor.

* address review comments

committed 5 years ago

02c1e117 Browse Directory

Adding support for dequantizing from int32 to float32. (#4130) · c1069108
shoubhik committed 5 years ago

c1069108 Browse Directory
[QNN] Change default rouning to UPWARD. (#4131) · 1c0e7435
Animesh Jain committed 5 years ago

1c0e7435 Browse Directory