- 27 Jan, 2020 1 commit
-
-
fixed a spelling mistake.
Kaiyan Chang committed
-
- 11 Jan, 2020 1 commit
-
-
* [Tutorial] Deploy Quantized Model on CUDA * update * update * address comments
Wuwei Lin committed
-
- 28 Dec, 2019 1 commit
-
-
zhuochen committed
-
- 26 Dec, 2019 1 commit
-
-
Wang Yucheng committed
-
- 23 Dec, 2019 1 commit
-
-
* Remove NNVM compiler
Tianqi Chen committed
-
- 15 Dec, 2019 1 commit
-
-
* Use the best tuner possible * Add comment denoting availability of better tuners * Fix typos and wording
miheer vaidya committed
-
- 26 Nov, 2019 1 commit
-
-
* [DOCS] Update main website to tvm.apache.org * Update jvm pom repo loc * Change the org to asf * Update ci addr to new one
Tianqi Chen committed
-
- 22 Nov, 2019 1 commit
-
-
tripley committed
-
- 18 Nov, 2019 1 commit
-
-
When getting cuda schedule passing single tensor seem to work but after changing target to "llvm" causes assert. Sending list on other hand makes both cuda and llvm targets happy. See https://discuss.tvm.ai/t/solved-simple-example-error-attributeerror-tensorslice-object-has-no-attribute-op/2245/3
miheer vaidya committed
-
- 14 Nov, 2019 1 commit
-
-
Zhi committed
-
- 12 Nov, 2019 1 commit
-
-
* WIP Run the TF tutorial on TF2 * Remove debugger statement. * Complete the support for TF2.0's `resize`. TF2.0 adds a `half_pixel_centers` attribute to the `resize` function in the image API. This commit completes the hooks in Relay's TF frontend. At the point of this commit, no new test yet. Also, this commit addresses solely the `resize` change. Other commits address other changes in TF2.0. * Support TF2.0 in the tutorial by using the compat API. This looks cleaner than trying to detect the TF version. * Use the TF compat API, so as to support TF2.0. This is a direct change, relying on the compat API provided by the TF team. This code will last as long as the compat API exists, so a "proper" support for TF1.x and 2.x will require more work in some future. * Partial support for EXPLICIT padding introduced in TF2.0. Explicit padding is a special case in TF2.0 (see reference linked below). Some models are serialized with that mode, and break TF support in TVM. Support is *partial* as EXPLICIT falls back to set padding on the Relay op, which only supports 2 values. At some point, padding may need to be extended to support 4 values, but that is out of scope of this support commit. Reference on EXPLICIT padding: https://github.com/tensorflow/tensorflow/commit/ec81825aaf7e848d9f8ddffdf1e0d20aebe9172c#diff-1d1c0bb0a880f85b6164f71dbb2f446e * Guard on checking for optional TF2.0 attribute. * Do not expect Relay to implement TF-specific attributes. The `half_pixel_centers` attribute is a new feature in TF2.0. Earlier commits of mine mistakenly introduce them in the Relay API. This is probably not what Relay is expected to support, and the semantics of `half_pixel_centers` is unclear (to me, at least) at this point. * Remove unclear comment. CR https://github.com/dmlc/tvm/pull/4104#discussion_r338705742 Addresses #4104 * Changes after review. Complying without understanding the rationale for now. * Fix the arguments set mistakenly. An argument ignored for the wrong operation.
Eric Platon committed
-
- 11 Nov, 2019 1 commit
-
-
* Add pass manager tutorial * fix some examples * retrigger ci * Update tutorials/dev/relay_pass_infra.py Co-Authored-By: 雾雨魔理沙 <lolisa@marisa.moe> * Add ToANormalForm link
Zhi committed
-
- 09 Nov, 2019 1 commit
-
-
* Add Auto TensorCore TensorCore Unit Test * Rebase to tvm master branch & Add auto tensor core * Code Refine * Add tensor core switch by pragma * Add pragma in tensor core example code * Get real tile size to replace hard coded 16 * support more than 2 dimensions (e.g. batchmatmul) for buffer bind scope * support batch matmul * Move cuda env check to tensor_core.cc * Coderefine for tensor_core.cc * Refine comments * Some refinements of code and comment * Update TensorCore UT to pass the CPU test * remove redundant code * matmul's storage align for different layout * Add support for differenct position of type cast * Add formal tutorial for auto tensorcore codegen * move tensorcore check up to tutorial code * code and doc refine * comment out tune_and_evaluate in tutorial * fix cpplint error
Minmin Sun (孙敏敏) committed
-
- 06 Nov, 2019 1 commit
-
-
Tianqi Chen committed
-
- 31 Oct, 2019 1 commit
-
-
* [CI] Move gpu docker binary to cuda10 * Fix the gcn tutorial
Tianqi Chen committed
-
- 24 Oct, 2019 1 commit
-
-
* add tensor core support * avoid memory bank conflict * fix thread sync & better performance * better performance * add schedule test for conv2d * extend into BatchMatMul * support config fragment shape and layout using intrinsic * add TensorCore tutorial * add int support and fix lint * address comment * add 32*16*8 TensorCore test * fix wmma include logic
Siyuan Feng committed
-
- 11 Oct, 2019 1 commit
-
-
* update build_gcn.py tutorial updates * support bias in GCN layer * download pretrained gcn model * verify model accuracy * use time_evaluator to measure runtime * fix adding bias in gcn layer * remove printing output * fix small bug * add DGL-PyTorch comparison into the build_gcn tutorial * add accuracy testing * adjust import order * handle different dgl versions * update number for dgl version checking
Chien-Yu Lin committed
-
- 08 Oct, 2019 1 commit
-
-
if n_trial is larger then config space.
Attila Dusnoki committed
-
- 07 Oct, 2019 1 commit
-
-
fix some typos
Bohan Hou committed
-
- 29 Sep, 2019 1 commit
-
-
* [AUTOTVM][DOCS] Add a link to autoTVM tutorial to direct the details of building NN with relay * [AUTOTVM][DOCS] Add a link to autoTVM tutorial to direct the details of building NN with relay
Neo Chien committed
-
- 28 Sep, 2019 1 commit
-
-
Tianqi Chen committed
-
- 17 Sep, 2019 1 commit
-
-
Junru Shao committed
-
- 07 Sep, 2019 1 commit
-
-
fix lld
Peter Yeh committed
-
- 05 Sep, 2019 1 commit
-
-
* Update from_source.rst * Update deploy_ssd_gluoncv.py
Haibin Lin committed
-
- 19 Aug, 2019 1 commit
-
-
[CoreML] Solve CoreML frontend issue of image scaler and padding so that Mobilenet mlmodel can work correctly. (#3800)
Zhao Wu committed
-
- 13 Aug, 2019 1 commit
-
-
pzq committed
-
- 07 Aug, 2019 1 commit
-
-
* add build gcn tutorial * add dgl to docker file * add dgl to docker file * Apply suggestions from code review Co-Authored-By: 雾雨魔理沙 <lolisa@marisa.moe> * add dgl to docker file * rerun checks * Revert "add build gcn tutorial" This reverts commit dbe8b5f0e02a13fdd586a9faa58fd1326653afb0. * resolve git issue * resolve git issue * resolve git issue * apply marisa's comment
Yulun Yao committed
-
- 31 Jul, 2019 2 commits
-
-
* intel graphics conv2d bugs fixed for inception_v3 * intel conv2d api updated, nn input size 4 condition added * review addressed * move conv_tags to attributes * ssd doc updated * address comment
Leyuan Wang committed -
Balint Cristian committed
-
- 27 Jul, 2019 1 commit
-
-
peterjc123 committed
-
- 06 Jul, 2019 2 commits
-
-
* [relay][frontend] Return Module from get_workload * pass entry_func to autotvm * disable tune * add property to module * mod.entry_func to main * .main -> mod["main"] * fix
Zhi committed -
- Install OpenCL headers - Set ANDROID_HOME environment variable
Ruslan Baratov committed
-
- 05 Jul, 2019 1 commit
-
-
Ruslan Baratov committed
-
- 02 Jul, 2019 1 commit
-
-
Zhi committed
-
- 30 Jun, 2019 1 commit
-
-
Alexander Pivovarov committed
-
- 27 Jun, 2019 1 commit
-
-
Alexander Pivovarov committed
-
- 20 Jun, 2019 1 commit
-
-
Zhi committed
-
- 17 Jun, 2019 1 commit
-
-
Zhi committed
-
- 12 Jun, 2019 1 commit
-
-
Marcus Shawcroft committed
-
- 10 Jun, 2019 1 commit
-
-
* Support x86 dilation conv2d and improve multi-batch conv2d * Fix lint
Yao Wang committed
-