Commits · 6798ba80d288e7c6132b30606b3cb70812579fe8 · wenyuanbo / tic

29 Jan, 2020 1 commit

[AUTOTVM] Fix a bug in generating the search space (#4779) · 1b8522e4

- Do not use numpy.prod which ignores integer (64 bits) overflows.
  This leads to an incorrect number of points in the search space.

committed 5 years ago

1b8522e4 Browse Directory

28 Jan, 2020 1 commit
- Safe remove tmpdir (#4781) · d54036a9
  Cody Yu committed 5 years ago
  
  d54036a9 Browse Directory
15 Jan, 2020 2 commits

Revert "[Relay][TOPI]Fix meaning of conv2d_transpose output_padding parameter (#4318)" (#4708) · 81e03ee7
```
This reverts commit dcf7fbf1.
```
Haichen Shen committed 5 years ago
81e03ee7 Browse Directory

[REFACTOR][IR] Unify IntImm and UIntImm (#4706) · ce807fe8

* [REFACTOR][IR] Unify IntImm and UIntImm

This PR unifies UIntImm and IntImm to simplify the codebase.
Unsigned integer constants will also be stored as IntImm.

For uint constant that does not fit into int64(rare case), we introduced
an intrinsic tvm_big_uint_imm to construct such intgers by its
lower and higher 32bits.

* [REFACTOR][IR] Remove UIntImm to use IntImm

* rename big->large

committed 5 years ago

ce807fe8 Browse Directory

11 Jan, 2020 1 commit

[Relay][TOPI]Fix meaning of conv2d_transpose output_padding parameter (#4318) · dcf7fbf1

* Add output_padding to generic

* Add output_padding to the reference impl

* Add output_padding to arm_cpu

* Add output_padding to the test

* Add output_padding for cuda

* Add output_padding for x86

* Make use of the new output_padding argument in Relay

* Adjust conv2d_transpose Relay test

* Fix lint errors

* Fix the VTA declaration of conv2d_transpose

* support for output padding in conv2d transpose

* some output padding will break IR pass

* Fix new conv2d_transpose test

* Update tophub

* Fix conv1d output_padding too.

* Fix the conv1d_transpose reference function.

* Fix the cuda impl

* fix the topi test for conv1d

* Update the versions in tophub.py

Co-authored-by: Thierry Moreau <tmoreau@octoml.ai>

committed 5 years ago

dcf7fbf1 Browse Directory

10 Jan, 2020 1 commit
- download fallback config file for search from tophub if it does not exist (#4671) · 06ce76b6
  Xingyu Zhou committed 5 years ago
  
  06ce76b6 Browse Directory
09 Jan, 2020 1 commit

[Autotvm] Use VM compile to extract autotvm tasks (#4328) · baae28b2

* [AutoTVM] Use vm compile in extracting task from relay

* update

* restructure vm compiler to reduce task extraction time

* x

* fix

* update doc

* udpate doc

* lint

committed 5 years ago

baae28b2 Browse Directory

27 Dec, 2019 1 commit
- [autotvm] fix typos in comment (#4591) · e6d9f89c
  Wang Yucheng committed 5 years ago
  
  e6d9f89c Browse Directory
26 Dec, 2019 1 commit

[TOPI][AutoTVM] NHWC conv2d templates for ARM (#3859) · 672b0909

* [AutoTVM][TOPI] NHWC conv2d templates (spatial pack) for ARM

As some frontends (tflite for example) are using NHWC as the default
layout, we are enabling NHWC schedule templates in TOPI and AutoTVM.

* some comments fix

committed 5 years ago

672b0909 Browse Directory

22 Dec, 2019 1 commit
- [TEST] Remove nnvm related code in topi and test script (#4562) · e6ff3f70
```
* [TEST] Remove nnvm related code in topi and test script

* Remove docs dep
```
  Tianqi Chen committed 5 years ago
  e6ff3f70 Browse Directory
18 Dec, 2019 1 commit
- Implement 1d deconvolution (#4476) · d430fbb5
  Alex Gladkov committed 5 years ago
  
  d430fbb5 Browse Directory
16 Dec, 2019 1 commit
- fix empty config caused KeyError (#4520) · 8e3b5d39
  Cody Yu committed 5 years ago
  
  8e3b5d39 Browse Directory
26 Nov, 2019 1 commit

[AutoTVM] select model with the most tuned schedules (#4404) · accc7db8

* select model with the most tuned schedules

* change detect empty map method

* modify model description for load_reference_log

committed 5 years ago

accc7db8 Browse Directory

21 Nov, 2019 1 commit

add GPU checking before compilation for rocm (#4394) · 786d7998

Previously, we would rely on the later phases to error out
(often for using too much shared memory). This enables the
checks on the IR that already exist for CUDA and OpenCL also
for ROCm.

committed 5 years ago

786d7998 Browse Directory

19 Nov, 2019 1 commit
- [nvcc] enable multiple arch in one fatbin (#4377) · f8f4ceb2
  Yizhi Liu committed 5 years ago
  
  f8f4ceb2 Browse Directory
16 Nov, 2019 1 commit

AutoTVM: selecting tuning templates when extracting task (#4338) · ccde31f1

* AutoTVM: selecting tuning templates when extracting task

Make the procedure of trying new templates easier.

Test: tests/python/relay/test_autotvm_task_extraction.py

* Use dict to match key for topi ops

* fix lint issue

* be more pythonic :)

committed 5 years ago

ccde31f1 Browse Directory

15 Nov, 2019 1 commit
- Bump up CUDA log version in tophub.py (#4347) · 888a3c35
  Alex Gladkov committed 5 years ago
  
  888a3c35 Browse Directory
11 Nov, 2019 1 commit

Add More Shape Functions (#4179) · 62521453

* Add shape functions

* Fix get_const_tuple

* Fix cpplint

* Fix pylint

* Fix pylint

* rebase and fix

* Check Any for infer type

* Fix expand_dim shape func for zero rank input

* Fix pooling infer type

* Address comment

* Register layout transform attr

committed 5 years ago

62521453 Browse Directory

07 Nov, 2019 1 commit

[AutoTVM] Add batch_matmul to tunable operations (#4242) · 14a5a358

* Batch matmul tuning running but with errors.

* Default x86 schedule as good as before.

* Code Cleanup

* Remove unused argument.

* improved template documentation.

* Silly lint fix

* Removed leftover comment.

* Moved cfg declaration to schedule for batch_matmul

* Moved x86 dense cfg declaration to schedule.

* lint fix

* Removed duplicate cfg declaration in dense.

* Reverted changes to dense.

committed 5 years ago

14a5a358 Browse Directory

29 Oct, 2019 1 commit

Optimizing autotvm task extraction speed (#4138) · 2386e74b

* Optimize task extraction speed

* correct pylint errors

* Delete unused function

* remove unnecessary argument

* resolve code review comments

* corrent cpp lint errors

* remove one more graph_json return value

* fix test bugs

committed 5 years ago

2386e74b Browse Directory

24 Oct, 2019 1 commit
- [TOPI] Tunable Template for Conv2D HWCN on CUDA (#4168) · 4ab73634
```
* support conv2d HWCN in AutoTVM and Relay

* fix lint

* fix comments and unit tests
```
  Cody Hao Yu committed 5 years ago
  4ab73634 Browse Directory
22 Oct, 2019 1 commit
- merge extract_from_program and extract_from_multiple_progam (#4173) · a21904a5
  Cody Hao Yu committed 5 years ago
  
  a21904a5 Browse Directory
03 Oct, 2019 1 commit
- [Relay][TopHub] Add switch to disable TopHub download (#4015) · 81118023
  Jon Soifer committed 5 years ago
  
  81118023 Browse Directory
01 Oct, 2019 1 commit
- Fix split's last factor issue (#4044) · 2d537621
  Cody Hao Yu committed 5 years ago
  
  2d537621 Browse Directory
28 Sep, 2019 1 commit
- [ARITH] cleanup the indexmod/div on python side (#4028) · f98035b0
  Tianqi Chen committed 5 years ago
  
  f98035b0 Browse Directory
18 Sep, 2019 1 commit
- [TVM][AutoTVM] cast filepath arguments to string (#3968) · f3abb3d8
  Neo Chien committed 5 years ago
  
  f3abb3d8 Browse Directory
16 Sep, 2019 3 commits

[TOPI] Setting up AutoTVM template for Intel Int8 conv2D (#3955) · 3edf5260
Animesh Jain committed 5 years ago

3edf5260 Browse Directory

[TOPI] Improve conv2d_transpose schedule on X86 and CUDA (#3948) · c846d17c

* improve conv2d_transpose x86 performance by reusing conv2d schedule

* parallelize across batches to make large-batch conv2d and conv2d_transpose faster

* improve doc for autotvm.task.space.FallbackConfigEntity.fallback_with_reference_log

* add fallback schedule for schedule_conv2d_transpose_nchw_cuda

* fix pylint

* fix pylint

* unify conv2d_transpose declaration in topi.nn and topi.x86

committed 5 years ago

c846d17c Browse Directory

[Graph Tuner] Fix benchmark layout in graph tuner (#3926) · b577171d
```
* Fix graph tuner benchmarking layout transform

* Add test
```
Yao Wang committed 5 years ago
b577171d Browse Directory

15 Sep, 2019 1 commit

[AutoTVM] Enhance tuning space of split (#3949) · da039794

* Refine policies for define_split

- Rename policy "all" to "factors"
- Add policy "verbose" and "power2"

* Refine search space

* add doc

committed 5 years ago

da039794 Browse Directory

07 Sep, 2019 2 commits

Fix a typo (#3913) · 6604593b
noituIover committed 5 years ago

6604593b Browse Directory

[TOPI] Intel graphics conv2d autotvm template added (#3839) · 70042b78

* update lint

* lint fixed

* lint updated

* lint fixed

* lint fixed

* lint fixed

* updates

* add intel graphics as a package

* remove print info

* depthwise conv2d schedule added for intel graphics

* asdf

* fix lint

* fix lint

* fix ci

* add channels

committed 5 years ago

70042b78 Browse Directory

05 Sep, 2019 1 commit
- Fix int32 range overflow by using int64 (#3870) · 98c99805
  kice committed 5 years ago
  
  98c99805 Browse Directory
28 Aug, 2019 1 commit
- [AutoTVM] Fix database APIs (#3821) · 062f8cc4
```
* [AutoTVM] Fix database APIs

* Refactor the byte conversion
```
  Cody Hao Yu committed 5 years ago
  062f8cc4 Browse Directory
11 Aug, 2019 2 commits
- Improve graph tuner dealing with Tuple (#3649) · 4f120464
```
* Improve graph tuner dealing with Tuple

* Add test case

* Move some data out of _base.py

* Fix lint
```
  Yao Wang committed 5 years ago
  4f120464 Browse Directory
- [TOPI] Update tophub according to the fix in schedule (opencl and rocm) (#3752) · 3d4ba8d3
  Lianmin Zheng committed 5 years ago
  
  3d4ba8d3 Browse Directory
06 Aug, 2019 1 commit
- Fix (2/2) [TOPI] conv2d schedule code (#3648) (#3717) · 831b32e7
```
* Fix the tile_rx and tile_ry issue.

    Note that this patch depends on pull request #9 in tvm-distro.
```
  mingwayzhang committed 5 years ago
  831b32e7 Browse Directory
02 Aug, 2019 1 commit
- [AutoTVM] Fix hang/crash issues on feature extraction (#3689) · 8ad36a17
```
* [AutoTVM] Fix hang/crash issues on feature extraction

* Update xgboost_cost_model.py

* fix lint
```
  Lianmin Zheng committed 5 years ago
  8ad36a17 Browse Directory
29 Jul, 2019 1 commit

[VTA] Refactor to increase platform coverage (Ultra96 etc.) (#3496) · f55609b4

* hardware refactor for increased FPGA coverage, small optimizations

* fix header

* cleaning up parameters that won't be needed for now

* streamlining makefile, and simplifying tcl scripts

* moving parameter derivation into pkg_config.py, keeping tcl scripts lightweight

* refactoring tcl script to avoid global variables

* deriving AXI signals in pkg_config.py

* unifying address map definition for hardware and software drivers

* single channel design for ultra96 to simplify build

* enable alu by default, no mul opcode for now

* hardware fix

* new bitstream; vta version

* avoid error when env variable is not set

* ultra96 cleanup

* further cleaning up tcl script for bitstream generation

* preliminary rpc server support on ultra96

* rpc server tracker scripts

* ultra96 ldflag

* ultra96 support

* ultra96 support

* cleanup line

* cmake support for ultra96

* simplify memory instantiation

* cleaning up IP parameter initialization

* fix queue instantiation

* 2019.1 transition

* fix macro def

* removing bus width from config

* cleanup

* fix

* turning off testing for now

* cleanup ultra96 ps insantiation

* minor refactor

* adding comments

* upgrading to tophub v0.6

* model used in TVM target now refers to a specific version of VTA for better autoTVM scheduling

* revert change due to bug

* rename driver files to be for zynq-type devices

* streamlining address mapping

* unifying register map offset values between driver and hardware generator

* rely on cma library for cache flush/invalidation

* coherence management

* not make buffer packing depend on data types that can be wider than 64bits

* refactor config derivation to minimize free parameters

* fix environment/pkg config interaction

* adding cfg dump property to pkgconfig:

* fix rpc reconfig

* fix spacing

* cleanup

* fix spacing

* long line fix

* fix spacing and lint

* fix line length

* cmake fix

* environment fix

* renaming after pynq since the driver stack relies on the pynq library - see pynq.io

* update doc

* adding parameterization to  name

* space

* removing reg width

* vta RPC

* update doc on how to edit vta_config.json

* fix path

* fix path

committed 5 years ago

f55609b4 Browse Directory

19 Jul, 2019 1 commit
- [AutoTVM]Improve graph tuner for multiple subgraphs (#3490) · be260836
```
* Improve boundary nodes in graph tuner

* Limit output node number

* Fix test

* Improve warning.

* Fix test
```
  Yao Wang committed 5 years ago
  be260836 Browse Directory