- 25 Nov, 2019 1 commit
-
-
* add half and mix precision support to cublas backend * add TensorCore support in CuDNN * enhance CuDNN support * address comments and fix lint * fix * add fp16 test
Siyuan Feng committed
-
- 23 Nov, 2019 1 commit
-
-
Alexander Pivovarov committed
-
- 21 Nov, 2019 1 commit
-
-
Animesh Jain committed
-
- 19 Nov, 2019 1 commit
-
-
* [PERF] parallel reduction in cpu * fix * x * update * lint * fix
Haichen Shen committed
-
- 18 Nov, 2019 2 commits
-
-
Tianqi Chen committed
-
* Add tf FloorMod * Add floor_div/mod into topi and relay * Add to rst * Fix test
Yao Wang committed
-
- 16 Nov, 2019 1 commit
-
-
Philip Hyunsu Cho committed
-
- 15 Nov, 2019 3 commits
-
-
ziyu-guo committed
-
Jian Weng committed
-
* [Contrib] Add MKL DNN * update * update
Haichen Shen committed
-
- 13 Nov, 2019 1 commit
-
-
* Support Proposal operator on CPU. * PyLint space issue * PyLint space issue * Pylint singleton-comparison issue
Zhao Wu committed
-
- 11 Nov, 2019 2 commits
-
-
* Add shape functions * Fix get_const_tuple * Fix cpplint * Fix pylint * Fix pylint * rebase and fix * Check Any for infer type * Fix expand_dim shape func for zero rank input * Fix pooling infer type * Address comment * Register layout transform attr
Yao Wang committed -
Animesh Jain committed
-
- 08 Nov, 2019 1 commit
-
-
* fix_winograd_cuda_kernel_size * add unit test
Cody Hao Yu committed
-
- 07 Nov, 2019 1 commit
-
-
* Batch matmul tuning running but with errors. * Default x86 schedule as good as before. * Code Cleanup * Remove unused argument. * improved template documentation. * Silly lint fix * Removed leftover comment. * Moved cfg declaration to schedule for batch_matmul * Moved x86 dense cfg declaration to schedule. * lint fix * Removed duplicate cfg declaration in dense. * Reverted changes to dense.
Josh Fromm committed
-
- 06 Nov, 2019 2 commits
-
-
* fix winograd * move get padding after kernel transform
Cody Hao Yu committed -
Tianqi Chen committed
-
- 30 Oct, 2019 1 commit
-
-
* Add support for Any op * Support ONNX frontend * Add doc * Add to relay docs * Dummy change to retrigger CI
Jon Soifer committed
-
- 28 Oct, 2019 1 commit
-
-
* :add scale2 for upsample * update unit test for upsampling * support latest upsample op for multiple frontend * fix lint * fix lint * fix lint * fix lint * update scale description and rebase
Xingyu Zhou committed
-
- 25 Oct, 2019 1 commit
-
-
Animesh Jain committed
-
- 24 Oct, 2019 3 commits
-
-
* add tensor core support * avoid memory bank conflict * fix thread sync & better performance * better performance * add schedule test for conv2d * extend into BatchMatMul * support config fragment shape and layout using intrinsic * add TensorCore tutorial * add int support and fix lint * address comment * add 32*16*8 TensorCore test * fix wmma include logic
Siyuan Feng committed -
* support conv2d HWCN in AutoTVM and Relay * fix lint * fix comments and unit tests
Cody Hao Yu committed -
Yao Wang committed
-
- 22 Oct, 2019 1 commit
-
-
mbarrett97 committed
-
- 17 Oct, 2019 1 commit
-
-
* [TOPI][x86] Cascade lake support. * Jenkins test debug 1. * Testing cascade lake alone.
Animesh Jain committed
-
- 15 Oct, 2019 1 commit
-
-
Animesh Jain committed
-
- 10 Oct, 2019 3 commits
-
-
* Add FIFO buffer op to enable explicit computation re-use in convolution * Add a test * Add end-to-end test with 1D convolution * Add a stub in MXNet frontend * Address reviewer comments * Add back stub for MXNet frontend
Philip Hyunsu Cho committed -
Leyuan Wang committed
-
Aniket Rangrej committed
-
- 09 Oct, 2019 2 commits
-
-
* add valid autotune * fix pylint
Leyuan Wang committed -
Animesh Jain committed
-
- 08 Oct, 2019 2 commits
-
-
Yizhi Liu committed
-
Animesh Jain committed
-
- 01 Oct, 2019 2 commits
-
-
* Add op argwhere * Move shape func to _algorithm.py * Add lint rule * Raise exception if rank is not supportted * move argwhere to transform * Add argwhere example * Fix lint * Add 1-d support * cleanup * Add more dtype support * CR comment * Improve error message * Docs * raise exception
Wei Chen committed -
* [topi] add ARM v8.2 udot (uint8) support * fix test case * fix common conv2d schedule * add back fp32_time in test * fix lint * fix doc, add support for int32_lanes=4, signed int * fix lint * add ic_bn % 4 checker in schedule
Yizhi Liu committed
-
- 30 Sep, 2019 1 commit
-
-
Tianqi Chen committed
-
- 28 Sep, 2019 1 commit
-
-
Tianqi Chen committed
-
- 27 Sep, 2019 1 commit
-
-
Tianqi Chen committed
-
- 26 Sep, 2019 1 commit
-
-
[TOPI][x86] Introduce schedule_injective_from_existing and unify external schedules for all targets (#3983) * Fix extern schedule for x86 * Register x86::schedule_extern * Fix * Fix * Replace extern.py with extern.h * Introduce new generic function schedule_injective_from_existing * Fix * Fix * Add back to C++ * Fix style * Injective schedule calls local schedule_injective_from_existing * Fix * Remove target arg from schedule_injective_from_existing * Fix docs * Try to fix unit test * Fix test * Fix other tests * Fix bug
Jon Soifer committed
-
- 25 Sep, 2019 1 commit
-
-
More schedules are making the conv2d.py file too large, so we'd like to move the spatial pack schedule to dedicated file before introducing NHWC schedule. No logic change in this patch.
黎明灰烬 committed
-