nn.py
34.4 KB
-
[Relay, Quantization, TOPI] int8 dense on CUDA & Dense op quantization (#2877) · cc09497e
* Quantize dense layers * Add out_dtype arggument to dense; Add dense_int8 on CUDA * Add topi unittest of dense int8 * Fix relay * Fix topi integration * Fix quantization * Update dense_rewrite * Triger CI * Change qconfig quantize_dense to quantize_op * Fix * Remove quantize_op from qconfig
Wuwei Lin committed