Commits · 41e1d5f911493c62cf3ae39fe1420ed0ae17d62c · wenyuanbo / tic

10 Mar, 2020 2 commits

Revive the Rust + SGX refactor (#4976) · 41e1d5f9

* Add Nick's changes's squashed

* Fix frontend compilation

* Re-enable Rust CI

* Add changes with conflicted badly

* Restructure import_module! macro in order to avoid unstable features

* Kill old unstable feature enablement

* Refactor common to use new APIs

* Move the code to stable

* Fix warning

Co-authored-by: Nick Hynes <nhynes@oasislabs.com>

committed Mar 09, 2020

41e1d5f9 Browse Files

[REDO AFTER GH BUG] Add support for quantized models via QNN (#5016) · 93dff448
```
This reverts commit f346c602.
```
masahi committed Mar 10, 2020
93dff448 Browse Files

09 Mar, 2020 4 commits

Revert "[Torch, QNN] Add support for quantized models via QNN (#4977)" (#5013) · f346c602
```
This reverts commit fc7f0783.
```
Animesh Jain committed Mar 09, 2020
f346c602 Browse Files
typo (#5008) · 6ee9c2f8
雾雨魔理沙 committed Mar 09, 2020

6ee9c2f8 Browse Files

[Runtime] MISRA-C compliant TVM runtime (#3934) · 450f7163

* implement of MISRA-C compliant TVM runtime;

* working on bundle_deploy_c demo

* move header files into include dir

* fix compatibility issues

* fix compatibility issues

* resolve most of the warnings and errros

* implement c_backend_api

* introduce bridge

* working well

* move to header files and bundle.c into src/runtime/crt

* clean up

* satisfy linter

* clean up

* test with the cat image

* remove synset

* refactoring

* refactoring

* refactoring

* initial crt_runtime_api.c

* improved compatibility with g++

* using exposed API in c_runtime_api.h

* call from c_runtime_api.h

* clean up

* lint

* merge into apps/bundle_deploy directory

Change-Id: I51904db81b8589e65d107d8ca77b47452e3812b5

* make the demo runs in ci

Change-Id: I2c24f8b592508833d3555311c2b24d1931f19385

* address review comments

Change-Id: I027ddff15c31fb4da0bd0e461427dce619de1f93

* release

Change-Id: I5ad5bb8426468aac9fc8d074e56ddea358a7fd91

* fix ci testing

Change-Id: Ic2e82fb3051b6c254ef32a964f976b61e3e5fe4d

* add test case for misra c runtime

Change-Id: Ie0dfd0ade6be4665b4384db7d260a6c69b35010f

* fread files in testing to avoid calling xxd

Change-Id: Ie7fbc16b4b0b9509918d986a841f443900813bef

committed Mar 09, 2020

450f7163 Browse Files

[VTA][Chisel,de10nano] Chisel fixes and de10nano support (#4986) · 5b4cf5df

* [VTA][de10nano] Enable user defined target frequency.

Issue:
The VTA target frequency on the DE10-Nano is hardcoded to 50MHz
unnecessarily limiting performance.

Solution:
Add a PLL to the FPGA sub-system along with support for the
selection of a user specified frequency at build time. The board
successfully builds and runs at 100MHz.

* Added a PLL in the soc_system.tcl platform designer generator
  script.

* Modified the Makefile to automatically set the target frequency
  from that specified in the pkg_config.py file.

* Modified the Makefile to generate a bitstream with an RBF
  format that enables programming of the FPGA directly from
  the on-board processor. Specifically, the RBF is generated in
  FastParallel32 mode with compression, which corresponds to the
  default MSEL switch setting on the board, i.e. 01010.

* Added a false path override to file set_clocks.sdc to turn off
  unconstrained path warnings on the VTA pulse LED.

* [VTA][TSIM] Add more debug and tracing options.

* Modified Makefile to change default config to DafaultDe10Config.

* Added option in Makefile to produce more detailed tracing
  for extra observability in debugging complex scenarios.

* Added option in Makefile to produce traces in FST format which
  are 2 orders of magnitude smaller, although much slower to
  generate.

* Added option in Makefile to build the simulator with GCC address
  sanitizer.

* Modified Makefile to not lint the scala code by default avoiding
  unintended wrong indentation. Linting should be better performed
  manually on a per-need basis.

* [VTA][de10nano] Enable remote programming of FPGA.

Issue:
The Cyclone V FPGA on board of the DE10-Nano can only be programmed
using the JTAG port, which is a limiting option for users.

Solution:
Add support for the remote programming of the FPGA implementing
the FPGA programming manager protocol published in the Cyclone V
user manual.

* Added file de10nano_mgr.h implementing an FPGA manager class
  that supports handling of control and status registers as well
  as a push-button option to program the FPGA. The class can be
  easily extended to include more registers if needed.

* Used an instance of the FPGA manager to implement function
  VTAProgram also warning users when incompatible bitstream
  files are used.

* Registered VTAProgram as a global function and modified
  the program_bitstream python class to use it.

* [VTA][de10nano] Enhance de10nano runtime support.

Issue:
The de10nano target has incomplete, non-working support
for runtime reconfiguration, bitstream programming, and
examples of usage.

Solution:
Complete runtime support for the de10nano target.

* Modified VTA.cmake to comment out a default override for
  VTA_MAX_XFER to 21 bit wide.

* Modified VTA.cmake to add needed de10nano include dirs.

* Modified relevant files to support de10nano same way as
  other targets for VTA runtime reconfiguration and FPGA
  programming.

* Added test_program_rpc.py example as a runtime FPGA
  programming example. Note that unlike the pynq target
  no bitstream is either downloaded or programmed when
  the bitstream argument is set to None.

* Cosmetic changes to vta config files.

* [VTA][Chisel] LoadUop FSM bug fix.

Issue:
The LoadUop FSM incorrectly advances the address of the next
uop to read from DRAM when the DRAM data valid bit is deasserted
and asserted at the end of a read. This is caused by a mismatch
in the logic of the state and output portions of the FSM.
This is one of two issues that was gating the correct operation
of VTA on the DE10-Nano target.

Solution:
Modify the logic of the output section of the FSM to include
a check on the DRAM read valid bit or fold the output assignemnt
into the state section.

* Folded the assignemnt of the next uop address in the state
  section of the FSM.

* [VTA][Chisel] Dynamically adjust DMA tranfer size.

Issue:
In the DE10-Nano target and possibly in others, DMA transfers that
cross the boundaries of memory pages result in incorrect reads and
writes from and to DRAM. When this happens depending on different
input values, VTA loads and stores exhibit incorrect results for
DMA pulses at the end of a transfer. This is one of two issues that
were gating the DE10-Nano target from functioning correctly, but may
affect other Chisel based targets.

Solution:
Add support for dynamically adjustble DMA transfer sizes in load
and store operations. For a more elegant and modular implementation
the feature can be enabled at compile time with a static constant
that can be passed as a configuration option.

* Modified the load and store finite state machines to dynamically
  adjust the size of initial and stride DMA transfers. The feature
  is enabled by default by virtue of the static constant
  ADAPTIVE_DMA_XFER_ENABLE.

* [VTA][Chisel] Improve FSIM/TSIM/FPGA xref debug.

Issue:
Cross reference between FSIM, TSIM, and Chisel based FPGA traces
is an invaluable instrument that enables fast analysis on FSIM,
and analysis/debug on TSIM and FPGA, especially for complex flows
like conv2d or full inferences. Currently this cannot be done
easily since a suitable reference is missing. The clock cycle
event counter cannot be used since it is undefined in FSIM and
not reliable between TSIM and FPGA because of different latencies.

Solution:
Introduce a new event counter that preserves a program order across
FSIM, TSIM, FPGA. We propose adding the accumulator write event
counter in the Chisel EventCounter class and a simple instrumentation
in the FSIM runtime code. Note that this technique enabled finding the
Chisel issues reportes in the PR, which would have been otherwise
far more difficult.

* Added the acc_wr_count event counter and changed interfaces
  accordingly.

* [VTA][de10nano] Comply with linting rules.

* [VTA] Appease make lint.

* [VTA] Disable pylint import not top level error.

* [VTA][Chisel,de10nano] Linting changes.

* Use CamelCase class names.

* Use C++ style C include header files.

* Add comments to Chisel makefile.

* [VTA][de10nano]

* Reorder C and C++ includes in de10nano_mgr.h.

* Restore lint as default target in Chisel Makefile.

* [VTA][de10nano] Do not use f string in pkg_config.py.

* [VTA][de10nano] Remove overlooked f strings in pkg_config.py.

* [VTA][de10nano] Fixed typo.

* [VTA][TSIM] Check if gcc has align-new.

* [VTA][Chisel] Make adaptive DMA transfer default.

* [VTA][RPC] Renamed VTA_PYNQ_RPC_* to VTA_RPC_*.

Issue:
With more FPGA targets coming online the initial method of
using individual environment variables to specify target IP and port
does not scale well.

Solution:
Use a single VTA_RPC_HOST, VTA_RPC_PORT pair to be changed
every time a different target is used. For instance in a script
used to benchmark all targets.

* Replaced every instance of VTA_PYNQ_RPC_HOST and VTA_PYNQ_RPC_PORT
  with VTA_RPC_HOST and VTA_RPC_PORT, respectively.

* [VTA][Chisel] Comply with new linter.

committed Mar 09, 2020

5b4cf5df Browse Files

08 Mar, 2020 5 commits
- Docs and Readme updated as per new namespace change (#4989) · 78fa1d5e
  ANSHUMAN TRIPATHY committed Mar 08, 2020
  
  78fa1d5e Browse Files
- lower plevel of conv2d winograd on cuda (#4987) · 7eed17b9
  Haichen Shen committed Mar 08, 2020
  
  7eed17b9 Browse Files
- kill from tvm import te (#5007) · 87faaf12
```
Co-authored-by: Michal Jamroz <jamroz@chem.uw.edu.pl>
```
  Michal Jamroz committed Mar 07, 2020
  87faaf12 Browse Files
- [FRONTEND][TENSORFLOW] support multiply outputs (#4980) · cf0a7e28
```
* [FRONTEND][TENSORFLOW] support multiply outputs

* [TENSORFLOW][TEST] add tf_testing.AddShapesToGraphDef test

* update frontend test

* retrigger CI
```
  zhengdi committed Mar 07, 2020
  cf0a7e28 Browse Files
- Add BN support with run-time mean and variance calculation (#4990) · ba477865
  lfengad committed Mar 08, 2020
  
  ba477865 Browse Files
07 Mar, 2020 6 commits
- [VTA][Chisel] Change Scala Linter scalafmt => scalastyle (#4998) · 6a36fb40
```
* scalafmt => scalastyle

Change-Id: Ifc590e7cb63585f35dfdc9efcf3c6287b1afb1dd

* scalafmt => scalastyle

Change-Id: I8aff2632dadda05d2896e28bdaf6f780a160a15a

* add indentation constraint

Change-Id: Ibeb00c11a5718ea47322ea2b82e757828af8af91

* trigger ci again
```
  Liangfu Chen committed Mar 07, 2020
  6a36fb40 Browse Files
- [COMMUNITY] @optima2005 -> reviewer (#5004) · 62424611
  Tianqi Chen committed Mar 07, 2020
  
  62424611 Browse Files
- [relay][external codegen] outline and inline lifted functions for external codegen (#4996) · 28ee806d
```
* outline and inline lifted functions for external codegen

* add batch_norm test

* test batch_norm inline
```
  Zhi committed Mar 07, 2020
  28ee806d Browse Files
- fix ROCm strategy for winograd conv selection (#5001) · fcf8420a
  Thomas Viehmann committed Mar 07, 2020
  
  fcf8420a Browse Files
- [Frontend][Torch] Check graph inputs match expected (#4992) · de346493
```
* [Frontend][Torch] Check graph inputs match expected

* error/warn when missing/unused graph inputs

* Change to use get_graph_input_names
```
  Jeremy Johnson committed Mar 07, 2020
  de346493 Browse Files
- Fix stride default value None in torch.nn.functional.avg_pool (#4984) · de0869de
```
* fix unordered dictionary problem for python version 3.5

* modify style

* default value of stride in torch.nn.functional.avg_pool is None

* delete prev modifications

* add testcase for nn.functional.avg_pool2d
```
  pyjhzwh committed Mar 07, 2020
  de0869de Browse Files
06 Mar, 2020 3 commits

[Runtime] Export GraphRuntime in tvm_runtime.dll (#5002) · e5044cb9
```
Co-authored-by: Jon Soifer <jonso@microsoft.com>
```
Jon Soifer committed Mar 06, 2020
e5044cb9 Browse Files

[topi][relay] add operation tan to TVM (#4938) · d992468d

* Add relay operation relay.op.tan.

* Update tan implementation in TVM.

* Update tests.

* Add shape function for tan.

* Add missing main test to python/frontend/tensorflow/test_forward.

* Revert, back to sin/cos.

* Revert "Revert, back to sin/cos."

This reverts commit 4da5b503b921585ba9d80944b29136142b575c40.

* Fix implementation of tan in cuda. Do not support tan for float16.

Simplify topi/tests/python/test_topi_math. Add testing for tan with float32 and float64.

Try again to implement tan as sin/cos in llvm.

committed Mar 05, 2020

d992468d Browse Files

Adding Hua Jiang as reviewer. (#4993) · a198c9fd
Tianqi Chen committed Mar 05, 2020

a198c9fd Browse Files

05 Mar, 2020 4 commits
- hotfix gcn tutorial fail (#4994) · a2c7f52c
  Zhi committed Mar 05, 2020
  
  a2c7f52c Browse Files
- refactor build module to take IRModule (#4988) · f63b249d
  Zhi committed Mar 05, 2020
  
  f63b249d Browse Files
- Conditions updated to cover better user scenarios (#4951) · fe74b37a
```
* Conditions updated to cover better user scenarios

* [1] New test case added

* [2] New test case added

* [3] Proper variable name used

* [4] Review Comments handled

* [5] Review comments handled

* [6] Review comments handled
```
  Tianqi Chen committed Mar 04, 2020
  fe74b37a Browse Files
- Fix gpu not found when running TVM docker (#4975) · 7a06bbed
  Tianqi Chen committed Mar 04, 2020
  
  7a06bbed Browse Files
04 Mar, 2020 3 commits

[Torch, QNN] Add support for quantized models via QNN (#4977) · fc7f0783

* qnn support initial import

* fix upsampling num input

* imagenet tests added

* add qunatized module tests

* quantized module tests working

* imagenet test working

* fix lint

* remove top level torch import to fix ci error

* disable lint warning on outside toplevel import

* revert parse -> convert change

* add comments to qnn translation

* address comments, add sample outputs

* add more comments

* refactor bias add and requantize step

committed Mar 04, 2020

fc7f0783 Browse Files

Tighten split's extent (#4931) · 585f9ce6

* Set split node's range to minimum of ext and split factor or split nparts, but only when PassDownDomain is called with allow_missing == false, i.e. by InferBound.  Add a helper PassUpThreadBinding() to get a map telling whether an IterVar has at least one leaf IterVar deriving from it binding to a thread. Add two unit tests.

* Enhance LoopVectorizer for vectorizing by 0.  Found at least one case from testtopi/tests/python/test_topi_transform.py::test_tile.

* Revert changes vectorize_loop.cc; when parent's ext is zero, set split's range to the factor or nparts.

* Update with comments.

* Refactor the ext tightening predicate.

* Fix reference types.

* Integrate tvm.te changes.

* Trivial comment change to trigger CI.

* Trivial comment correction to trigger testing.

committed Mar 04, 2020

585f9ce6 Browse Files

[Torch] fix unordered dictionary problem for python version under 3.6 (#4982) · 5a0f39b5
```
* fix unordered dictionary problem for python version 3.5

* modify style
```
pyjhzwh committed Mar 04, 2020
5a0f39b5 Browse Files

03 Mar, 2020 2 commits

[Relay] Target annotation for external codegen (#4933) · 98b17590

* op based external compiler annotation

* Use TVM register directly

* Small fix

* test graph

Co-authored-by: Cody Yu <comaniac0422@gmail.com>

committed Mar 03, 2020

98b17590 Browse Files

Pin xgboost dependency version to 0.90 (#4965) · 09ddc3eb

* Sets xgboost dependency to be 0.90, preventing
   segfaults during TVM python unit tests execution

 * This is discussed in issue #4953

committed Mar 02, 2020

09ddc3eb Browse Files

02 Mar, 2020 4 commits
- [Frontend] [Tensorflow] ReadVariableOp operator support (#4952) · 8502691b
```
* tf frontend read variable op

* pylint fix

* tf frontend freezed graph pruned ops
```
  maheshambule committed Mar 02, 2020
  8502691b Browse Files
- [Relay][Pass] Add inline pass (#4927) · 0fb48360
```
* add inline pass

* IsInline -> IsMarkedInlined

* fix comment
```
  Zhi committed Mar 01, 2020
  0fb48360 Browse Files
- [Doc]refine the example description of max/min/sum/tag_scope (#4974) · 892dc91a
  Ethan-Yan27 committed Mar 01, 2020
  
  892dc91a Browse Files
- [TFLITE]FLOOR_MOD & FLOOR_DIV support (#4971) · 1c8e5b93
```
* TFLite Floor_div & floor_mod parsing code

* Review comment updated
```
  Samuel committed Mar 02, 2020
  1c8e5b93 Browse Files
01 Mar, 2020 3 commits

[Relay][FastMath] Relay pass to use fast exp/tanh (#4873) · 51af454a
```
* [Relay][FastMath] Relay pass to use fast exp/tanh

* Adding required_pass to the tests.

* FastMath test changes.
```
Animesh Jain committed Mar 01, 2020
51af454a Browse Files
[TOPI] fix docs errors (#4973) · 900d99cd
zhengdi committed Mar 01, 2020

900d99cd Browse Files

[Torch] Upsampling op support and enable registering a user defined op conversion map (#4961) · 92a24278

* add custom conversion map

* add roi align test using custom convert map

* refactor test

* add support for upsampling op and test on segmentation models

* remove redundant no_grad

* add upsampling test case

* make the default custom map None, instead of empty dict

* updated tests, remove packaging and drop PT 1.2 support

* add better support for aten::to and tests

* add a note on dilation in x86

committed Mar 01, 2020

92a24278 Browse Files

29 Feb, 2020 2 commits

Added CopyFromBytes and CopyToBytes convenience methods to NDArray. Fixed typos. (#4970) · 474c70d7

* Added CopyFromBytes and CopyToBytes convenience methods.  Fixed typos.

* Removed unneed argument check

* Use TVMArrayCopyFrom/ToBytes methods

* Moved CopyFrom/ToBytes to ndarray.cc

* CopyToBytes impl was using CopyFromBytes.  Fixed

* changed inline to TVM_DLL

* Used impl from TVMArrayCopyTo/FromBytes into NDArray CopyTo/FromBytes

* Move implementation of all CopyFrom/ToBytes into a common impls

* make arg const

* simplify method impl

committed Feb 29, 2020

474c70d7 Browse Files

[Frontend][TFLite] Add parser support for l2_normalization (#4966) · 2355caa8

* [Frontend][TFLite] Add parser support for l2_normalization

* TF doesn't provide uint8 support
* TFL does the normalization only if it's over the last axis
* TFL uses only the default value for expilon

* Change error message

committed Feb 29, 2020

2355caa8 Browse Files

28 Feb, 2020 2 commits

[DOCS] Fix sphinx precheck (#4967) · a449d8b1
```
* [DOCS] Fix sphinx precheck

* ignore keras warnings

* Remove more warnings
```
Tianqi Chen committed Feb 29, 2020
a449d8b1 Browse Files

[Relay, Torch] Clean up and refactor PyTorch frontend (#4944) · 7ccb4363

* The initial import of refactored implementation, all tests passed

* enable mobilenet v2 test

* minor cleanup

* reorg

* fix lint

* use input names that come with torch IR

* fix typo

* introduce parse_operators

* fix lint

* add _ prefix

committed Feb 28, 2020

7ccb4363 Browse Files