- 29 Jan, 2020 1 commit
-
-
- Do not use numpy.prod which ignores integer (64 bits) overflows. This leads to an incorrect number of points in the search space.
wpan11nv committed
-
- 28 Jan, 2020 1 commit
-
-
Cody Yu committed
-
- 15 Jan, 2020 2 commits
-
-
This reverts commit dcf7fbf1.
Haichen Shen committed -
* [REFACTOR][IR] Unify IntImm and UIntImm This PR unifies UIntImm and IntImm to simplify the codebase. Unsigned integer constants will also be stored as IntImm. For uint constant that does not fit into int64(rare case), we introduced an intrinsic tvm_big_uint_imm to construct such intgers by its lower and higher 32bits. * [REFACTOR][IR] Remove UIntImm to use IntImm * rename big->large
Tianqi Chen committed
-
- 11 Jan, 2020 1 commit
-
-
* Add output_padding to generic * Add output_padding to the reference impl * Add output_padding to arm_cpu * Add output_padding to the test * Add output_padding for cuda * Add output_padding for x86 * Make use of the new output_padding argument in Relay * Adjust conv2d_transpose Relay test * Fix lint errors * Fix the VTA declaration of conv2d_transpose * support for output padding in conv2d transpose * some output padding will break IR pass * Fix new conv2d_transpose test * Update tophub * Fix conv1d output_padding too. * Fix the conv1d_transpose reference function. * Fix the cuda impl * fix the topi test for conv1d * Update the versions in tophub.py Co-authored-by: Thierry Moreau <tmoreau@octoml.ai>
abergeron committed
-
- 10 Jan, 2020 1 commit
-
-
Xingyu Zhou committed
-
- 09 Jan, 2020 1 commit
-
-
* [AutoTVM] Use vm compile in extracting task from relay * update * restructure vm compiler to reduce task extraction time * x * fix * update doc * udpate doc * lint
Haichen Shen committed
-
- 27 Dec, 2019 1 commit
-
-
Wang Yucheng committed
-
- 26 Dec, 2019 1 commit
-
-
* [AutoTVM][TOPI] NHWC conv2d templates (spatial pack) for ARM As some frontends (tflite for example) are using NHWC as the default layout, we are enabling NHWC schedule templates in TOPI and AutoTVM. * some comments fix
黎明灰烬 committed
-
- 22 Dec, 2019 1 commit
-
-
* [TEST] Remove nnvm related code in topi and test script * Remove docs dep
Tianqi Chen committed
-
- 18 Dec, 2019 1 commit
-
-
Alex Gladkov committed
-
- 16 Dec, 2019 1 commit
-
-
Cody Yu committed
-
- 26 Nov, 2019 1 commit
-
-
* select model with the most tuned schedules * change detect empty map method * modify model description for load_reference_log
Xingyu Zhou committed
-
- 21 Nov, 2019 1 commit
-
-
Previously, we would rely on the later phases to error out (often for using too much shared memory). This enables the checks on the IR that already exist for CUDA and OpenCL also for ROCm.
Thomas Viehmann committed
-
- 19 Nov, 2019 1 commit
-
-
Yizhi Liu committed
-
- 16 Nov, 2019 1 commit
-
-
* AutoTVM: selecting tuning templates when extracting task Make the procedure of trying new templates easier. Test: tests/python/relay/test_autotvm_task_extraction.py * Use dict to match key for topi ops * fix lint issue * be more pythonic :)
黎明灰烬 committed
-
- 15 Nov, 2019 1 commit
-
-
Alex Gladkov committed
-
- 11 Nov, 2019 1 commit
-
-
* Add shape functions * Fix get_const_tuple * Fix cpplint * Fix pylint * Fix pylint * rebase and fix * Check Any for infer type * Fix expand_dim shape func for zero rank input * Fix pooling infer type * Address comment * Register layout transform attr
Yao Wang committed
-
- 07 Nov, 2019 1 commit
-
-
* Batch matmul tuning running but with errors. * Default x86 schedule as good as before. * Code Cleanup * Remove unused argument. * improved template documentation. * Silly lint fix * Removed leftover comment. * Moved cfg declaration to schedule for batch_matmul * Moved x86 dense cfg declaration to schedule. * lint fix * Removed duplicate cfg declaration in dense. * Reverted changes to dense.
Josh Fromm committed
-
- 29 Oct, 2019 1 commit
-
-
* Optimize task extraction speed * correct pylint errors * Delete unused function * remove unnecessary argument * resolve code review comments * corrent cpp lint errors * remove one more graph_json return value * fix test bugs
LiangHao committed
-
- 24 Oct, 2019 1 commit
-
-
* support conv2d HWCN in AutoTVM and Relay * fix lint * fix comments and unit tests
Cody Hao Yu committed
-
- 22 Oct, 2019 1 commit
-
-
Cody Hao Yu committed
-
- 03 Oct, 2019 1 commit
-
-
Jon Soifer committed
-
- 01 Oct, 2019 1 commit
-
-
Cody Hao Yu committed
-
- 28 Sep, 2019 1 commit
-
-
Tianqi Chen committed
-
- 18 Sep, 2019 1 commit
-
-
Neo Chien committed
-
- 16 Sep, 2019 3 commits
-
-
Animesh Jain committed
-
* improve conv2d_transpose x86 performance by reusing conv2d schedule * parallelize across batches to make large-batch conv2d and conv2d_transpose faster * improve doc for autotvm.task.space.FallbackConfigEntity.fallback_with_reference_log * add fallback schedule for schedule_conv2d_transpose_nchw_cuda * fix pylint * fix pylint * unify conv2d_transpose declaration in topi.nn and topi.x86
Yuwei Hu committed -
* Fix graph tuner benchmarking layout transform * Add test
Yao Wang committed
-
- 15 Sep, 2019 1 commit
-
-
* Refine policies for define_split - Rename policy "all" to "factors" - Add policy "verbose" and "power2" * Refine search space * add doc
Cody Hao Yu committed
-
- 07 Sep, 2019 2 commits
-
-
noituIover committed
-
* update lint * lint fixed * lint updated * lint fixed * lint fixed * lint fixed * updates * add intel graphics as a package * remove print info * depthwise conv2d schedule added for intel graphics * asdf * fix lint * fix lint * fix ci * add channels
Leyuan Wang committed
-
- 05 Sep, 2019 1 commit
-
-
kice committed
-
- 28 Aug, 2019 1 commit
-
-
* [AutoTVM] Fix database APIs * Refactor the byte conversion
Cody Hao Yu committed
-
- 11 Aug, 2019 2 commits
-
-
* Improve graph tuner dealing with Tuple * Add test case * Move some data out of _base.py * Fix lint
Yao Wang committed -
Lianmin Zheng committed
-
- 06 Aug, 2019 1 commit
-
-
* Fix the tile_rx and tile_ry issue. Note that this patch depends on pull request #9 in tvm-distro.
mingwayzhang committed
-
- 02 Aug, 2019 1 commit
-
-
* [AutoTVM] Fix hang/crash issues on feature extraction * Update xgboost_cost_model.py * fix lint
Lianmin Zheng committed
-
- 29 Jul, 2019 1 commit
-
-
* hardware refactor for increased FPGA coverage, small optimizations * fix header * cleaning up parameters that won't be needed for now * streamlining makefile, and simplifying tcl scripts * moving parameter derivation into pkg_config.py, keeping tcl scripts lightweight * refactoring tcl script to avoid global variables * deriving AXI signals in pkg_config.py * unifying address map definition for hardware and software drivers * single channel design for ultra96 to simplify build * enable alu by default, no mul opcode for now * hardware fix * new bitstream; vta version * avoid error when env variable is not set * ultra96 cleanup * further cleaning up tcl script for bitstream generation * preliminary rpc server support on ultra96 * rpc server tracker scripts * ultra96 ldflag * ultra96 support * ultra96 support * cleanup line * cmake support for ultra96 * simplify memory instantiation * cleaning up IP parameter initialization * fix queue instantiation * 2019.1 transition * fix macro def * removing bus width from config * cleanup * fix * turning off testing for now * cleanup ultra96 ps insantiation * minor refactor * adding comments * upgrading to tophub v0.6 * model used in TVM target now refers to a specific version of VTA for better autoTVM scheduling * revert change due to bug * rename driver files to be for zynq-type devices * streamlining address mapping * unifying register map offset values between driver and hardware generator * rely on cma library for cache flush/invalidation * coherence management * not make buffer packing depend on data types that can be wider than 64bits * refactor config derivation to minimize free parameters * fix environment/pkg config interaction * adding cfg dump property to pkgconfig: * fix rpc reconfig * fix spacing * cleanup * fix spacing * long line fix * fix spacing and lint * fix line length * cmake fix * environment fix * renaming after pynq since the driver stack relies on the pynq library - see pynq.io * update doc * adding parameterization to name * space * removing reg width * vta RPC * update doc on how to edit vta_config.json * fix path * fix path
Thierry Moreau committed
-
- 19 Jul, 2019 1 commit
-
-
* Improve boundary nodes in graph tuner * Limit output node number * Fix test * Improve warning. * Fix test
Yao Wang committed
-