codegen_cuda.h
3.9 KB
-
[CodeGen][CUDA] Enhance CUDA codegen for SelectNode (#4983) · afa84171
- This patch allows CUDA backend to emit correct code for selects with vector conditions, which may be produced by floordiv op lowering etc.. - This already works for llvm BE, as llvm select instruction supports vector conditions. Signed-off-by: Wei Pan <weip@nvidia.com>
Wei Pan committed