| Name |
Last commit
|
Last update |
|---|---|---|
| .. | ||
| autotvm | ||
| dev | ||
| frontend | ||
| language | ||
| optimize | ||
| topi | ||
| README.txt | ||
| cross_compilation_and_rpc.py | ||
| relay_quick_start.py | ||
| tensor_expr_get_started.py |
* Add Auto TensorCore TensorCore Unit Test * Rebase to tvm master branch & Add auto tensor core * Code Refine * Add tensor core switch by pragma * Add pragma in tensor core example code * Get real tile size to replace hard coded 16 * support more than 2 dimensions (e.g. batchmatmul) for buffer bind scope * support batch matmul * Move cuda env check to tensor_core.cc * Coderefine for tensor_core.cc * Refine comments * Some refinements of code and comment * Update TensorCore UT to pass the CPU test * remove redundant code * matmul's storage align for different layout * Add support for differenct position of type cast * Add formal tutorial for auto tensorcore codegen * move tensorcore check up to tutorial code * code and doc refine * comment out tune_and_evaluate in tutorial * fix cpplint error
| Name |
Last commit
|
Last update |
|---|---|---|
| .. | ||
| autotvm | Loading commit data... | |
| dev | Loading commit data... | |
| frontend | Loading commit data... | |
| language | Loading commit data... | |
| optimize | Loading commit data... | |
| topi | Loading commit data... | |
| README.txt | Loading commit data... | |
| cross_compilation_and_rpc.py | Loading commit data... | |
| relay_quick_start.py | Loading commit data... | |
| tensor_expr_get_started.py | Loading commit data... |