Name |
Last commit
|
Last update |
---|---|---|
.. | ||
exec | ||
testing | ||
top | ||
__init__.py | ||
bitstream.py | ||
build_module.py | ||
environment.py | ||
graph.py | ||
intrin.py | ||
ir_pass.py | ||
libinfo.py | ||
pkg_config.py | ||
program_bitstream.py | ||
rpc_client.py |
* add tensor core support * avoid memory bank conflict * fix thread sync & better performance * better performance * add schedule test for conv2d * extend into BatchMatMul * support config fragment shape and layout using intrinsic * add TensorCore tutorial * add int support and fix lint * address comment * add 32*16*8 TensorCore test * fix wmma include logic
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
exec | Loading commit data... | |
testing | Loading commit data... | |
top | Loading commit data... | |
__init__.py | Loading commit data... | |
bitstream.py | Loading commit data... | |
build_module.py | Loading commit data... | |
environment.py | Loading commit data... | |
graph.py | Loading commit data... | |
intrin.py | Loading commit data... | |
ir_pass.py | Loading commit data... | |
libinfo.py | Loading commit data... | |
pkg_config.py | Loading commit data... | |
program_bitstream.py | Loading commit data... | |
rpc_client.py | Loading commit data... |