- 21 Feb, 2020 1 commit
-
-
* support cuda tensorcore subbyte int data type in auto tensorcore * add lisence * pass cpplint * fix code review comments * merge the int4/int1 codegen tutorial into the existing auto tensorcore tutorial * using master's new API * disable tuning when cuda is not enabled * address cr comment * do not run the tuning * fix test failure * fix cpplint error * fix bool type reduction bug * 1. fix a index bug 2. fix returned bytes value of int1/int4/uint4 * fix typo
Orion34C committed
-
- 20 Feb, 2020 3 commits
- 19 Feb, 2020 3 commits
-
-
* [REFACTOR] Polish ffi convention. - Remove the src/api, keep registration local to the c++ function. - Remove the api_internal as it is no longer needed. * Update the codebase walk through
Tianqi Chen committed -
hcyang committed
-
Andrew committed
-
- 18 Feb, 2020 8 commits
-
-
Tianqi Chen committed
-
* [Relay] Expose FunctionGetAttr to Python * add test Co-authored-by: Jon Soifer <jonso@microsoft.com>
Jon Soifer committed -
* Basic test working * Almost all tests working. * all tests passing. * Fixed lint. * Improved Style.
Josh Fromm committed -
Tianqi Chen committed
-
Tianqi Chen committed
-
Tianqi Chen committed
-
Fixed bugs that occured when using bitwise operators on floating point type expressions. Further crash when using ops <<, >>, %. Finally added regression tests for both types of bug. (#4892)
pankratz committed -
- Move the related files to tvm.te - Move build_module.py to tvm.driver
Tianqi Chen committed
-
- 17 Feb, 2020 5 commits
-
-
* Fix bug in re-processing call node * Add test * Add to main * temp changes to work from another machine * fix rest of tests * fix test_reuse_call_merge * fix merge Co-authored-by: Jon Soifer <jonso@microsoft.com>
Jon Soifer committed -
Tianqi Chen committed
-
Alex Gladkov committed
-
various minor editorial updates - style, grammar, typos.
Baden Hughes committed -
Zhi committed
-
- 16 Feb, 2020 3 commits
-
-
Tianqi Chen committed
-
* add additional switch to handle nested call node * Fix VM compiler for while loop with free var
masahi committed -
- Do not emit __shared__ etc. as part of type for casting - Fix fp16 reduction kernels with compiler errors: "no operator "+" matches these operands, volatile half + volatile half This patch inserts casts to remove volatile type qualifier following volatile loads (fp16 only). CUDA fp16 library headers should add volatile member functions. - Update have_fp16 to include compute 6.1 GPUs, which do support fp16, although their fp16 throughput is low. Updated tests. Signed-off-by: Wei Pan <weip@nvidia.com>
wpan11nv committed
-
- 15 Feb, 2020 3 commits
- 14 Feb, 2020 3 commits
-
-
* [QNN] Doc fix on quantize and convolution * update test
masahi committed -
- This allows to better utilize the memory bandwidth - Note that not all cases are vectorized for fp16 datatype. For instance, when the size is not a multiple of 1024, the inner loop may be an expression that cannot be vectorized. In this case, a small inner loop is still benefical for latency hidding. Signed-off-by: Wei Pan <weip@nvidia.com>
wpan11nv committed -
- Move related files into the corresponding location as in C++ - Keep the top-level TVM API backward compatible to make minimum changes in topi
tqchen committed
-
- 13 Feb, 2020 6 commits
-
-
Co-Authored-By: Wei Chen <ipondering.weic@gmail.com>
Zhi committed -
Co-Authored-By: Wei Chen <ipondering.weic@gmail.com>
Zhi committed -
Zhi Chen committed
-
Add tuneable conv3d_ndhwc schedule
Alex Gladkov committed -
* [FRONTEND][TFLITE] Add support for TFLite_Detection_PostProcess This adds support for the custom operator TFLite_Detection_PostProcess which is commonly used in object detection networks such as SSD Mobilenet. It only adds support for when use_regular_nms = False. Change-Id: I819b253c0eb6f0fa55da65d2634e09359b888828 * Added a test for the tflite custom op Change-Id: Ie5baa092deae9a8bcffd2ebd9f6d346b90e58afd * Removed trailing comma Change-Id: Ib08f02b5f1a59a883048bfb36e4321152cd2e7f2 * Added spaces between divide Change-Id: If1171fc03d211a809cedeb800804394972af4060 * Formatted comment Change-Id: I3ce7e69b8d2c73aec57369c1c64ea1eec07f087b * Reduced line length in test Change-Id: I49eaafc3369070f8f3e85fbb965ad20972096c68 * Set random seed for test Change-Id: I542a787d11422ea83c52147b2cb1144fcef0dd77 * Fixes to style Change-Id: I2971b8ecebe08c882b2481a99f67cfbe515e0b1f * Assert for incorrect number of inputs Change-Id: I393f3b3b62be73e427498d98456fb1d5a214e0af * Change comparison to pass linting The linter was updated, so I needed to fix a small style issue as a result. Change-Id: Ia3c954565a00de92e7fb1912eae9ed9875d60c7c
mbarrett97 committed -
Move the related target modules into tvm.target. API change: - tvm.target.current_target -> tvm.target.Target.current - tvm.datatype -> tvm.target.datatype
tqchen committed
-
- 12 Feb, 2020 4 commits
-
-
tqchen committed
-
tqchen committed
-
tqchen committed
-
* [REFACTOR][PY][API-CHANGE] establish tvm.ir, migrate corresponding relay files. This PR establishes tvm.ir and migrates the corresponding relay files into the new folder. API Change: - relay.Module -> tvm.IRModule * Update with ADT * Migrate transform * address comments * Migrate module * Migrate json_compact * Migrate attrs * Move LoweredFunc to stmt temporarily * temp migrate container * Finish migrate container
Tianqi Chen committed
-
- 11 Feb, 2020 1 commit
-
-
hlu1 committed
-