* support cuda tensorcore subbyte int data type in auto tensorcore * add lisence * pass cpplint * fix code review comments * merge the int4/int1 codegen tutorial into the existing auto tensorcore tutorial * using master's new API * disable tuning when cuda is not enabled * address cr comment * do not run the tuning * fix test failure * fix cpplint error * fix bool type reduction bug * 1. fix a index bug 2. fix returned bytes value of int1/int4/uint4 * fix typo
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
contrib | Loading commit data... | |
cuda | Loading commit data... | |
graph | Loading commit data... | |
metal | Loading commit data... | |
micro | Loading commit data... | |
opencl | Loading commit data... | |
opengl | Loading commit data... | |
rocm | Loading commit data... | |
rpc | Loading commit data... | |
sgx | Loading commit data... | |
stackvm | Loading commit data... | |
vm | Loading commit data... | |
vulkan | Loading commit data... | |
builtin_fp16.cc | Loading commit data... | |
c_runtime_api.cc | Loading commit data... | |
container.cc | Loading commit data... | |
cpu_device_api.cc | Loading commit data... | |
dso_library.cc | Loading commit data... | |
file_util.cc | Loading commit data... | |
file_util.h | Loading commit data... | |
library_module.cc | Loading commit data... | |
library_module.h | Loading commit data... | |
meta_data.h | Loading commit data... | |
module.cc | Loading commit data... | |
ndarray.cc | Loading commit data... | |
object.cc | Loading commit data... | |
object_internal.h | Loading commit data... | |
pack_args.h | Loading commit data... | |
registry.cc | Loading commit data... | |
runtime_base.h | Loading commit data... | |
system_library.cc | Loading commit data... | |
thread_pool.cc | Loading commit data... | |
thread_storage_scope.h | Loading commit data... | |
threading_backend.cc | Loading commit data... | |
workspace_pool.cc | Loading commit data... | |
workspace_pool.h | Loading commit data... |