util.h
2.2 KB
-
[Perf] Enhance cudnn and cublas backend and enable TensorCore (#4353) · dabde40f
* add half and mix precision support to cublas backend * add TensorCore support in CuDNN * enhance CuDNN support * address comments and fix lint * fix * add fp16 test
Siyuan Feng committed