Name |
Last commit
|
Last update |
---|---|---|
.. | ||
autotvm | ||
dev | ||
frontend | ||
language | ||
optimize | ||
topi | ||
README.txt | ||
cross_compilation_and_rpc.py | ||
relay_quick_start.py | ||
tensor_expr_get_started.py |
* Add Auto TensorCore TensorCore Unit Test * Rebase to tvm master branch & Add auto tensor core * Code Refine * Add tensor core switch by pragma * Add pragma in tensor core example code * Get real tile size to replace hard coded 16 * support more than 2 dimensions (e.g. batchmatmul) for buffer bind scope * support batch matmul * Move cuda env check to tensor_core.cc * Coderefine for tensor_core.cc * Refine comments * Some refinements of code and comment * Update TensorCore UT to pass the CPU test * remove redundant code * matmul's storage align for different layout * Add support for differenct position of type cast * Add formal tutorial for auto tensorcore codegen * move tensorcore check up to tutorial code * code and doc refine * comment out tune_and_evaluate in tutorial * fix cpplint error
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
autotvm | Loading commit data... | |
dev | Loading commit data... | |
frontend | Loading commit data... | |
language | Loading commit data... | |
optimize | Loading commit data... | |
topi | Loading commit data... | |
README.txt | Loading commit data... | |
cross_compilation_and_rpc.py | Loading commit data... | |
relay_quick_start.py | Loading commit data... | |
tensor_expr_get_started.py | Loading commit data... |