Commits · e3016371c4f3b3772c618f362498968d8b4e862a · wenyuanbo / tic

07 Jan, 2020 1 commit
- [QNN] Channel wise quantization - Quantize & Requantize (#4629) · 76efece3
  Animesh Jain committed 5 years ago
  
  76efece3 Browse File
03 Jan, 2020 1 commit
- {QNN] Making scale/zero_points as expr instead of attrs. (#4611) · 0720ed67
  Animesh Jain committed 5 years ago
  
  0720ed67 Browse File
14 Nov, 2019 1 commit
- [QNN] Use Int16 upcast in Fallback Conv2D. Fix test names. (#4329) · f34dea41
  Animesh Jain committed 5 years ago
  
  f34dea41 Browse File
02 Oct, 2019 1 commit
- [QNN][Relay] Calling Dialect passes from inside Relay Build API. (#3971) · 36201fe9
  Animesh Jain committed 5 years ago
  
  36201fe9 Browse File
20 Sep, 2019 1 commit
- [QNN] Renaming tests to follow the Relay nomenclature. (#3975) · 0840b064
  Animesh Jain committed 5 years ago
  
  0840b064 Browse File
02 Sep, 2019 1 commit
- [QNN] Requantize - Optimize lowering for some corner cases. (#3864) · 1bc83853
  Animesh Jain committed 5 years ago
  
  1bc83853 Browse Directory
30 Aug, 2019 1 commit
- [Relay][QNN] QNNtoRelay & QNNLegalize Pass utility using Relay Legalize API. (#3838) · 671421a8
  Animesh Jain committed 5 years ago
  
  671421a8 Browse Directory
16 Aug, 2019 1 commit

QNN quantize and dequantize operators. (#3745) · d3eb9cb8

* QNN quantize and dequantize operators.

* addressing review comments.

* addressing review comments.

* Adding new line at the end of the file.

* Adhering to styling guidelines.

* Adding name to contributors.

* Fixing lint issue.

* Fixing file name.

* Removing unnecessary code.

committed 5 years ago

d3eb9cb8 Browse Directory

08 Aug, 2019 1 commit

[QNN] Requantize operator (#3531) · a78adbd5

* [Relay] [Quantization] WIP - Common files for the qauntization work.

* [Relay] [Quantization] WIP - Prototyping requantize op.

* Requantize operator implementation.

Requantize converts one quantized tensor representation to another quantized
representation. The PR has following implementation features

- Requantize operator defined in qnn namespace - relay.qnn.requantize
- Lowering of the requantize to exisiting Relay operators
- Integer fixed point implementation of requantize
    - Two rounding modes - FE_UPWARDS (round towards infinity) and
    FE_AWAY_FROM_ZERO (std::round behavior)
- Floating point implementation as well, that can act as reference or can be
used for devices when FP32 computation is not used.
- Unit test cases

Relevant Issue - https://github.com/dmlc/tvm/issues/2351

Credit to TFLite and GemmLowp to provide reference implementations.

* Typo and lint fixes.

* Doc fix.

* Uncommenting the lint script (fixing mistake).

* Modifying the unit tests.

* Moving C++ files into src/relay/qnn

* Moving python files to python/tvm/relay/qnn. Some minor fixes.

* Moving the attrs.h inside the include directory.

* Pushing files that I forgot earlier. Changing util location.

* Incorporating comments. API change. Lint fixes.

* Modifying the GetFixedPointMultiplierShift API as per comments.

* Forgot the dialect change.

* Changing rewrite to qnn_lower.

* Renaming Quantize to Qnn for clarity.

* Remove use_int_domain.

* Incorportaing review comments.

* Adding API doc for QNN dialect.

* Move the qnn_lower pass to transform namespace.

* Moving from expr to module. Adding namespace in C++.

* Minor sentence rewrites. Added qnn namespace.

* Added the API doc.

* Chanding default out_dtype to int8. Adding a test with in/out_dtype as uint8.

* Style fixes. Better error messages.

* Adding documentation.

* More documentation fixes.

* Adding out dtype check for requantize.

* Adding corner case for FP32 to fixed point conversion.

* Adding extra line.

* Documentation fix.

* Adding static inline.

* Incorporating jackwish comment. Removed idtype from requantize lowering.

* Removing Quantize/Dequantize code. Restricting Requantize to (u)int8/int32.

* Style fixes.

* Fix the docs.

* Move to Legalize API.

committed 5 years ago

a78adbd5 Browse Directory