Commits · e91cc5aba8f99ffe216a6188edf6818e1b87237f · wenyuanbo / tic

23 Dec, 2019 1 commit
- [VTA][Chisel] End-to-end Inference with Chisel VTA (#4574) · dfc4009c
```
* [VTA][Chisel] End-to-end Inference with Chisel VTA

* Update TensorAlu.scala
```
  Liangfu Chen committed 5 years ago
  dfc4009c Browse Directory
21 Dec, 2019 1 commit
- [VTA] improved virtual memory mapping (#4545) · 44cb1054
```
* [VTA] improved virtual memory mapping

* Update virtual_memory.cc
```
  Liangfu Chen committed 5 years ago
  44cb1054 Browse Directory
16 Dec, 2019 1 commit
- fix crash issue in tsim backend (#4527) · 10392854
  Liangfu Chen committed 5 years ago
  
  10392854 Browse Directory
11 Dec, 2019 1 commit
- [VTA] Speedup TSIM by Multi-threading (#4491) · 599775f4
```
This PR tries to increase TSIM performance by introducing multi-threading support.
```
  Liangfu Chen committed 5 years ago
  599775f4 Browse Directory
09 Dec, 2019 1 commit

[VTA] Bringing group convolution support (#4421) · 6ab15806

* group conv operator support for VTA

* autotvm tuning script for group conv2d

* lint fix

* lint fix

* lint fix

* addressing comments

committed 5 years ago

6ab15806 Browse Directory

28 Nov, 2019 1 commit
- fix multiple transfer issue in loaduop (#4442) · 52160f99
  Liangfu Chen committed 5 years ago
  
  52160f99 Browse Directory
27 Nov, 2019 2 commits

[VTA] Enable streamlined GEMM execution (#4392) · 3a1c8c5d

* disable pipelined adder and enable streamlined gemm execution

* pipeline first layer of adder

* explain difference between pipeadder and adder

* add comment for explaining the hard-coded latency

committed 5 years ago

3a1c8c5d Browse Directory

[VTA][HotFix] Relay->VTA quantization fix (#4433) · 651bdf2f
```
* relay -> vta fix

* setting optlevel to 3 for quantization to fold batchnorm
```
Thierry Moreau committed 5 years ago
651bdf2f Browse Directory

26 Nov, 2019 1 commit
- removing nnvm dep from VTA sources (#4419) · aab65ad2
  Thierry Moreau committed 5 years ago
  
  aab65ad2 Browse Directory
24 Nov, 2019 3 commits

[License] move cma_api to 3rdparty. separate BSD 2-clause and 3-clause (#4410) · 6a9e6e4d
```
* [License] move cma_api to 3rdparty. separate BSD 2-clause and 3-clause

* add zlib license for blockingconcurrentqueue.h
```
Yizhi Liu committed 5 years ago
6a9e6e4d Browse Directory

[LINT] Remove unnecessary copyright message for files with ASF header (#4409) · c8772288

* [LINT] Improve the check tool to handle ASF copyright message.

* [LINT] Remove unnecessary copyright message as per ASF requirement.

* Fix codegen hybrid

* [LINT] Broaden license checks to include html, xml

* [LINT] Fix rest of the files

* Fix notice

* [LINT] Improve check file type error message

committed 5 years ago

c8772288 Browse Directory

[Release] resolve license issues (#4408) · 8ba1d7d1
Yizhi Liu committed 5 years ago

8ba1d7d1 Browse Directory

22 Nov, 2019 1 commit
- update_document_after_repository_renamed (#4398) · 030a1632
  tripley committed 5 years ago
  
  030a1632 Browse Directory
18 Nov, 2019 1 commit
- [SOURCE] Add ASF header to __init__.py files (#4359) · 00521fab
  Tianqi Chen committed 5 years ago
  
  00521fab Browse Directory
15 Nov, 2019 1 commit
- [VTA] Bug fix for padded load with large inputs (#4293) · 5b1ca85d
```
* bug fix for padded load with large inputs

* Update TensorLoad.scala

* Update test_vta_insn.py
```
  Liangfu Chen committed 5 years ago
  5b1ca85d Browse Directory
14 Nov, 2019 1 commit
- fix error when memory_id is VTA_MEM_ID_OUT (#4330) · dab7172a
  jason-song-dev committed 5 years ago
  
  dab7172a Browse Directory
11 Nov, 2019 1 commit

[RUNTIME][REFACTOR] Use object protocol to support runtime::Module (#4289) · f823c577

Previously runtime::Module was supported using shared_ptr.
This PR refactors the codebase to use the Object protocol.

It will open doors to allow easier interpolation between
Object containers and module in the future.

committed 5 years ago

f823c577 Browse Directory

06 Nov, 2019 2 commits
- [VTA] Hotfix for padded load test in Chisel VTA (#4264) · 1eca1ad1
```
* Update TensorUtil.scala

* Update test_vta_insn.py
```
  Liangfu Chen committed 5 years ago
  1eca1ad1 Browse Directory
- [DOCS] Update link loc (#4257) · 86b844b9
  Tianqi Chen committed 5 years ago
  
  86b844b9 Browse Directory
02 Nov, 2019 1 commit

[VTA] Performance optimize, remove unnecessary contigious memory use. (#4246) · 008aa838

* [VTA] Performance optimize, remove unnecessary contigious memory use.

Issue:
Uop maintain a cache vector to copy uop data into contigious DRAM memory for
FPGA/Simulator use, but this cache vector not get clear after FPGA/Simulator
core run, in Resnet18 case, if we printf the cache size in UopQueue::ReadBarrier
function, we can saw such cache size keep increase, this would cause
no use data copy and unnecessary contigous DRAM memory malloc.

Analysis:
This issue caused by not clear cache_ vector when do
uop_queue_.Reset().

Solution:
Override BaseQueue Reset function in UopQueue and add cache_ clear
logic.

* address review comments, remove spacing.

committed 5 years ago

008aa838 Browse Directory

27 Oct, 2019 1 commit

[VTA][Chisel] TSIM VTA Source Refactor (#4163) · 13b28566

* app init push

* fix on readme

* change name, add bit serial explanantion

* rm serialLoadMM, change doc

* syntax change for readme

* add parallel test functionality

* fix readme

* add python doc

* syntax

* init commit

* fix empty line

* fix typo

committed 5 years ago

13b28566 Browse Directory

24 Oct, 2019 1 commit

TensorCore Support using Intrinsic (#4136) · 324a9607

* add tensor core support

* avoid memory bank conflict

* fix thread sync & better performance

* better performance

* add schedule test for conv2d

* extend into BatchMatMul

* support config fragment shape and layout using intrinsic

* add TensorCore tutorial

* add int support and fix lint

* address comment

* add 32*16*8 TensorCore test

* fix wmma include logic

committed 5 years ago

324a9607 Browse Directory

10 Oct, 2019 1 commit

[VTA][TSIM] Serial GEMM Application Added (#4082) · 47e50e1e

* app init push

* fix on readme

* change name, add bit serial explanantion

* rm serialLoadMM, change doc

* syntax change for readme

* add parallel test functionality

* fix readme

* add python doc

* syntax

committed 5 years ago

47e50e1e Browse Directory

08 Oct, 2019 1 commit
- Fix wrong n_trial number in autotvm tutorials' progress bar (#4070) · 90b10b80
```
if n_trial is larger then config space.
```
  Attila Dusnoki committed 5 years ago
  90b10b80 Browse Directory
28 Sep, 2019 1 commit
- [ARITH] cleanup the indexmod/div on python side (#4028) · f98035b0
  Tianqi Chen committed 5 years ago
  
  f98035b0 Browse Directory
13 Sep, 2019 1 commit

[VTA] RPC path update. (#3924) · 06aecc60

Issue:
RPC path get changed into "vta_rpc" from "pynq_rpc", but related
document still use old informaiton.

Solution:
Update RPC path information.

committed 5 years ago

06aecc60 Browse Directory

09 Sep, 2019 1 commit
- [VTA][Config] hotfix denano10 (#3918) · 83d2418a
  Luis Vega committed 5 years ago
  
  83d2418a Browse Directory
07 Sep, 2019 1 commit

[VTA] Support TLPP in function simulator. (#3555) · 50c4546f

* [VTA] Support TLPP in function simulator.
Issue:
currently vta function simulator just doing serialized instruction
execution, the dependency logic of runtime ISA which use for task
level pipe line parallelism can not get verified by function simulator.

Solution:
make the simulator driver to be multiple thread and support TLPP.

Benefit:
TLPP support VTA function simulator would make VTA logic testing/debug
/change more easy.

replace boost lockfree queue

add configure control for simulator tlpp enable or disable.

change code tyle into google style.

Wrap queue read/write and sync logic to make function call more simple.

Add some comments.

Remove MT logic, change into Single thread mode.

address review comments.

code style change to match google code style and add comments.

add cmake macro to enable/disable simulator tlpp logic.

submodule update.

correct file name mentioned in comments.

* remove USE_VTA_FSIM_TLPP.

committed 5 years ago

50c4546f Browse Directory

05 Sep, 2019 3 commits

[VTA][TOPI] Conv2d transpose (deconvolution) operator support (#3777) · 23c22812

* initial conv2d_transpose

* correct select operator

* cleanup

* fix

* fix correcness check

* conv2d transpose declaration fix

* autotvm conv2d_transpose tuning script

* ir pass fix

* fix tuning script

* deriving params from env, adding bias

* removing bias comp from deconvolution

* lint

* fix

* lint

* lint

* turning off cpu

* lint, ops

* lint

* import fix

* removing hard coded values

* lint

committed 5 years ago

23c22812 Browse Directory

[VTA][Relay] Extending Vision model coverage compilation for VTA (#3740) · 028f47ce

* adding support for graphpack over multiply op

* increasing resnet model coverage

* fix indentation

* lint

* moving recursion limit fix into graphpack pass

* moving recursionlimit to relay init

* pooling on NCHWnc format

* adding more models

* deploy_resnet_on_vta.py

* trailing line

* generalizing to vision models

* merge conflicts

* fix, apply quantization to VTA only

* improving comments

* trimming models that have runtime issues for the moment

* lint

* lint

* lint

committed 5 years ago

028f47ce Browse Directory

[VTA] de10-nano driver (#3394) · 734df8d5

* rework;

* `de10-nano` -> `de10nano`;

* fix compilation error;

* bug fix;

* Update install.md

* Update install.md

* Update install.md

* update with current runtime;

* add debug messages;

* bug fix in cma kernel module;

committed 5 years ago

734df8d5 Browse Directory

04 Sep, 2019 2 commits
- [VTA][Chisel] add ISA BitPat generation (#3891) · f07fe80a
  Luis Vega committed 5 years ago
  
  f07fe80a Browse Directory
- [VTA][Chisel] add scalafmt and format existing scala codebase (#3880) · 5fe61fd1
```
* [VTA][Chisel] add scalafmt and format existing scala codebase

* change column width to 100

* add scalafmt conf file as a valid file type

* add asf header to scalafmt conf file and rerun formatter
```
  Luis Vega committed 5 years ago
  5fe61fd1 Browse Directory
03 Sep, 2019 1 commit

[VTA] Fix TSIM compile error in Linux (add missing -fPIC flag) (#3876) · f4a28c4b

* [VTA] Fix TSIM compile error in Linux (add missing -fPIC flag);

* [VTA] Fix TSIM compile error in Linux (add missing -fPIC flag);

* fix indentation problem;

committed 5 years ago

f4a28c4b Browse Directory

02 Sep, 2019 1 commit
- [VTA][Chisel] rename USE_TSIM macro with USE_VTA64 and cleanup runtime (#3872) · 4434a89c
  Luis Vega committed 5 years ago
  
  4434a89c Browse Directory
01 Sep, 2019 1 commit
- [VTA][TSIM] add virtual memory support to tsim example (#3868) · 9d880bd3
```
* [VTA][TSIM] add virtual memory support to tsim example

* fix identation

* remove USE_TSIM macro and use 32-bit addr instead
```
  Luis Vega committed 5 years ago
  9d880bd3 Browse Directory
29 Aug, 2019 1 commit

[VTA] Fix RewriteForceSerial Function logic issue. (#3854) · 187600da

Issue:
RewriteForceSerial is a debug function to force instructions
to be serialize instead of parrallel running, by doing so we
can isolate some parallel problem or do performance compare
between parallel and serialize. But this function have some
problem, once get enabled by set debug flag, vta would stuck
when running on pynq board.

Analysis:
once enable RewriteForceSerial, the dependency logic is different
with default one, but we still use same logic to generate FINISH
and other logic, this would cause dead lock.

Solution:
give a different dependency settings when enable RewriteForceSerial.

committed 5 years ago

187600da Browse Directory

27 Aug, 2019 1 commit
- [VTA] Parameterization and bug fix in TensorLoad module (#3841) · 347e3d9d
  Liangfu Chen committed 5 years ago
  
  347e3d9d Browse Directory
26 Aug, 2019 1 commit

[VTA][TSIM] Introduce Virtual Memory for TSIM Driver (#3686) · 92b6ca71

* initial virtual memory;

* initial integration;

* include the header file in cmake;

* implement allocation with virtual to logical address mapping;

* virtual memory for tsim_driver;

* implement the missing memory release function;

* readability improvement;

* readability improvement;

* address review comments;

* improved robustness in virtual memory allocation;

* remove VTA_TSIM_USE_VIRTUAL_MEMORY macro and use virtual memory for tsim by default;

* link tvm against vta library;

* merge with master

* build virtual memory system without linking tvm against vta;

* minor change;

* reuse VTA_PAGE_BYTES;

* using DRAM class from sim_driver as VirtualMemoryManager;

* satisfy linter;

* add comments in code;

* undo changes to Makefile

* undo changes to Makefile

* retrigger ci;

* retrigger ci;

* directly call into VirtualMemoryManager::Global()

committed 5 years ago

92b6ca71 Browse Directory

18 Aug, 2019 1 commit
- [VTA][TSIM] parallel TSIM hardware compilation with macOS and debug support (#3797) · 80fc943f
```
* [VTA][TSIM] parallel hardware compilation with macOS and debug support

* simplify
```
  Liangfu Chen committed 5 years ago
  80fc943f Browse Directory