tests/python/unittest/test_target_codegen_cuda.py · b2a32ddfd3e60560d08e6ecfa25a96f219a744e6 · wenyuanbo / tic

[CodeGen][CUDA] Vectorization for intrinsics (#5101) · 05b0f7e0

- This allows to emit vectorized loads/stores
  for CUDA math intrinsics.

- A few intrinsics should be lowered as CUDAMath not CUDAFastMath ones.

- Fixed the code block identation.

committed Mar 22, 2020

05b0f7e0

test_target_codegen_cuda.py 17.9 KB