* support cuda tensorcore subbyte int data type in auto tensorcore * add lisence * pass cpplint * fix code review comments * merge the int4/int1 codegen tutorial into the existing auto tensorcore tutorial * using master's new API * disable tuning when cuda is not enabled * address cr comment * do not run the tuning * fix test failure * fix cpplint error * fix bool type reduction bug * 1. fix a index bug 2. fix returned bytes value of int1/int4/uint4 * fix typo
| Name |
Last commit
|
Last update |
|---|---|---|
| .. | ||
| arg_binder.cc | Loading commit data... | |
| arg_binder.h | Loading commit data... | |
| bound_checker.cc | Loading commit data... | |
| combine_context_call.cc | Loading commit data... | |
| coproc_sync.cc | Loading commit data... | |
| detect_device.cc | Loading commit data... | |
| ffi_api.cc | Loading commit data... | |
| hoist_if_then_else.cc | Loading commit data... | |
| infer_fragment.cc | Loading commit data... | |
| inject_copy_intrin.cc | Loading commit data... | |
| inject_double_buffer.cc | Loading commit data... | |
| inject_prefetch.cc | Loading commit data... | |
| inject_virtual_thread.cc | Loading commit data... | |
| inline.cc | Loading commit data... | |
| ir_deep_compare.cc | Loading commit data... | |
| ir_util.cc | Loading commit data... | |
| ir_util.h | Loading commit data... | |
| lift_attr_scope.cc | Loading commit data... | |
| loop_partition.cc | Loading commit data... | |
| lower_custom_datatypes.cc | Loading commit data... | |
| lower_intrin.cc | Loading commit data... | |
| lower_thread_allreduce.cc | Loading commit data... | |
| lower_tvm_builtin.cc | Loading commit data... | |
| lower_warp_memory.cc | Loading commit data... | |
| make_api.cc | Loading commit data... | |
| remap_thread_axis.cc | Loading commit data... | |
| remove_no_op.cc | Loading commit data... | |
| rewrite_unsafe_select.cc | Loading commit data... | |
| simple_passes.cc | Loading commit data... | |
| skip_assert.cc | Loading commit data... | |
| split_host_device.cc | Loading commit data... | |
| ssa.cc | Loading commit data... | |
| storage_access.cc | Loading commit data... | |
| storage_access.h | Loading commit data... | |
| storage_flatten.cc | Loading commit data... | |
| storage_rewrite.cc | Loading commit data... | |
| storage_sync.cc | Loading commit data... | |
| tensor_core.cc | Loading commit data... | |
| unroll_loop.cc | Loading commit data... | |
| vectorize_loop.cc | Loading commit data... | |
| verify_compact_buffer.cc | Loading commit data... | |
| verify_gpu_code.cc | Loading commit data... | |
| verify_memory.cc | Loading commit data... |