[Perf] Enhance cudnn and cublas backend and enable TensorCore (#4353)
* add half and mix precision support to cublas backend * add TensorCore support in CuDNN * enhance CuDNN support * address comments and fix lint * fix * add fp16 test
Showing
This diff is collapsed.
Click to expand it.
Please
register
or
sign in
to comment