Name |
Last commit
|
Last update |
---|---|---|
.. | ||
datatype | ||
llvm | ||
opt | ||
source | ||
spirv | ||
stackvm | ||
build_common.h | ||
codegen.cc | ||
generic_func.cc | ||
intrin_rule.cc | ||
intrin_rule.h | ||
target.cc | ||
target_info.cc |
* support cuda tensorcore subbyte int data type in auto tensorcore * add lisence * pass cpplint * fix code review comments * merge the int4/int1 codegen tutorial into the existing auto tensorcore tutorial * using master's new API * disable tuning when cuda is not enabled * address cr comment * do not run the tuning * fix test failure * fix cpplint error * fix bool type reduction bug * 1. fix a index bug 2. fix returned bytes value of int1/int4/uint4 * fix typo
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
datatype | Loading commit data... | |
llvm | Loading commit data... | |
opt | Loading commit data... | |
source | Loading commit data... | |
spirv | Loading commit data... | |
stackvm | Loading commit data... | |
build_common.h | Loading commit data... | |
codegen.cc | Loading commit data... | |
generic_func.cc | Loading commit data... | |
intrin_rule.cc | Loading commit data... | |
intrin_rule.h | Loading commit data... | |
target.cc | Loading commit data... | |
target_info.cc | Loading commit data... |