* add tensor core support * avoid memory bank conflict * fix thread sync & better performance * better performance * add schedule test for conv2d * extend into BatchMatMul * support config fragment shape and layout using intrinsic * add TensorCore tutorial * add int support and fix lint * address comment * add 32*16*8 TensorCore test * fix wmma include logic
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
node | Loading commit data... | |
relay | Loading commit data... | |
runtime | Loading commit data... | |
api_registry.h | Loading commit data... | |
arithmetic.h | Loading commit data... | |
attrs.h | Loading commit data... | |
base.h | Loading commit data... | |
buffer.h | Loading commit data... | |
build_module.h | Loading commit data... | |
channel.h | Loading commit data... | |
codegen.h | Loading commit data... | |
data_layout.h | Loading commit data... | |
dtype.h | Loading commit data... | |
expr.h | Loading commit data... | |
expr_operator.h | Loading commit data... | |
ir.h | Loading commit data... | |
ir_functor_ext.h | Loading commit data... | |
ir_mutator.h | Loading commit data... | |
ir_pass.h | Loading commit data... | |
ir_visitor.h | Loading commit data... | |
logging.h | Loading commit data... | |
lowered_func.h | Loading commit data... | |
operation.h | Loading commit data... | |
packed_func_ext.h | Loading commit data... | |
schedule.h | Loading commit data... | |
schedule_pass.h | Loading commit data... | |
target_info.h | Loading commit data... | |
tensor.h | Loading commit data... | |
tensor_intrin.h | Loading commit data... |