* add tensor core support * avoid memory bank conflict * fix thread sync & better performance * better performance * add schedule test for conv2d * extend into BatchMatMul * support config fragment shape and layout using intrinsic * add TensorCore tutorial * add int support and fix lint * address comment * add 32*16*8 TensorCore test * fix wmma include logic
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
apps | Loading commit data... | |
config | Loading commit data... | |
hardware | Loading commit data... | |
include/vta | Loading commit data... | |
python/vta | Loading commit data... | |
scripts | Loading commit data... | |
src | Loading commit data... | |
tests | Loading commit data... | |
tutorials | Loading commit data... | |
README.md | Loading commit data... |