- This allows to emit vectorized loads/stores for CUDA math intrinsics. - A few intrinsics should be lowered as CUDAMath not CUDAFastMath ones. - Fixed the code block identation.
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
arith | Loading commit data... | |
autotvm | Loading commit data... | |
contrib/hybrid | Loading commit data... | |
driver | Loading commit data... | |
ir | Loading commit data... | |
node | Loading commit data... | |
printer | Loading commit data... | |
relay | Loading commit data... | |
runtime | Loading commit data... | |
support | Loading commit data... | |
target | Loading commit data... | |
te | Loading commit data... | |
tir | Loading commit data... | |
README.md | Loading commit data... |