Commits · 6798ba80d288e7c6132b30606b3cb70812579fe8 · wenyuanbo / tic

30 Jan, 2020 1 commit
- Make sure to visit the arguments of inlined functions (#4783) · 6798ba80
  abergeron committed Jan 29, 2020
  
  6798ba80 Browse Files
29 Jan, 2020 2 commits
- [AUTOTVM] Fix a bug in generating the search space (#4779) · 1b8522e4
```
- Do not use numpy.prod which ignores integer (64 bits) overflows.
  This leads to an incorrect number of points in the search space.
```
  wpan11nv committed Jan 28, 2020
  1b8522e4 Browse Files
- [Python] Replace os.path.exists with try...except...else (#4784) · 3827ccb5
  hlu1 committed Jan 29, 2020
  
  3827ccb5 Browse Files
28 Jan, 2020 2 commits
- [PassManager] Implement pass manager tracing API (#4782) · 9c383f64
```
* Implement pass tracing API

* Set is_before correctly

* Add docs for trace function

* Fix lint

* Remove PDB

* Ensure trace_func is set before calling

* Fix conditional
```
  Jared Roesch committed Jan 28, 2020
  9c383f64 Browse Files
- Safe remove tmpdir (#4781) · d54036a9
  Cody Yu committed Jan 27, 2020
  
  d54036a9 Browse Files
27 Jan, 2020 4 commits
- [Relay][Frontend][ONNX] Broadcast condition, x, and y for Where op (#4774) · de919cbd
```
* ONNX frontend broadcast condition

* fix

* fix style

Co-authored-by: Jon Soifer <jonso@microsoft.com>
```
  Jon Soifer committed Jan 27, 2020
  de919cbd Browse Files
- properly extract error type from windows error message (#4780) · f71a10c5
```
Co-authored-by: Jon Soifer <jonso@microsoft.com>
```
  Jon Soifer committed Jan 28, 2020
  f71a10c5 Browse Files
- [Build] Explicitly link to cublasLt if it exists (#4776) · 00ec7f9c
```
* Explicitly link to cublasLt

* Only link cublasLt if it's found

Co-authored-by: Jon Soifer <jonso@microsoft.com>
```
  Jon Soifer committed Jan 28, 2020
  00ec7f9c Browse Files
- Update tune_simple_template.py (#4778) · 9056fc40
```
fixed a spelling mistake.
```
  Kaiyan Chang committed Jan 26, 2020
  9056fc40 Browse Files
25 Jan, 2020 1 commit
- Bump prebuilt-image version in demo dockerfile (#4770) · bef00f7e
  HUAN-PING SU committed Jan 26, 2020
  
  bef00f7e Browse Files
24 Jan, 2020 5 commits
- [Bugfix][Frontend][TF] Fix incorrect calculations in tf SLICE (#4518) · 9bd2c7b4
```
* fix formula for calculating end indices when size[i] == -1
* add a test case for size[i] == -1
* discard expanding dimension of begin_value & end_value since
  it is needed only if you pass them as scalars not as tensors.
* discard 'slice_tensor' variable so that implementation matches
  the tf parser pattern
```
  Ina Dobreva committed Jan 23, 2020
  9bd2c7b4 Browse Files
- add missing nullptr check (#4773) · 26621257
  masahi committed Jan 24, 2020
  
  26621257 Browse Files
- [TOPI] Remove cpp upsampling and resize op (#4769) · 69d2f9bd
```
* remove cpp upsampling

* remove cpp resize
```
  masahi committed Jan 23, 2020
  69d2f9bd Browse Files
- Fix Tensorflow conv3d pad bug, add non-cubic data and kernel tests (#4772) · 1ae44cf0
  Alex Gladkov committed Jan 23, 2020
  
  1ae44cf0 Browse Files
- [Doc] TVM_REGISTER_API -> TVM_REGISTER_GLOBAL (#4768) · 4d4346d1
  hlu1 committed Jan 24, 2020
  
  4d4346d1 Browse Files
23 Jan, 2020 2 commits

[VTA] Support network which have no unique operator as start/stop name for graph pack. (#4703) · b9328d02

* [VTA] Support network which have no unique operator as start/stop name
for graph pack.

[Issue]
  Current vta use 'start' and 'stop' name to define the pack start point
  and end point, but this method not work for these network which have
  no 2 unique operator as  start point and stop point.

[Solution]
  In this solution we give 2 addtional parameters start_name_indx and
  stop_name_indx to make vta pack logic work with the said network,
  for exampl for following networks which have no unique operator,

  %0 = nn.add
  %1 = nn.conv2d
  %2 = nn.batch_norm
  %3 = nn.leaky_relu
  %4 = nn.add
  %5 = nn.conv2d
  %6 = nn.batch_norm
  %7 = nn.leaky_relu
  %8 = nn.add

  with this solution we can use following parameter format to make
  vta work on it.

  relay_prog = graph_pack(
                //....
                start_name="nn.add",
                stop_name="nn.add",
                start_name_idx=0,
                stop_name_idx=4)

  to apply on new network, by printing the network we can get index information like following.

  print(mod.astext(show_meta_data=False))
  relay_prog = graph_pack(mod
                          ...
                          start_name="nn.add",
                          stop_name="nn.add",
                          start_name_idx=0,
                          stop_name_idx=4)

* address review comments and fix index count bug

issue:
when do print(mod), the output not only the Call is also have other type
like Var, need add logic to count all except meta.

solution:
add related logic

* address review comments.

* address review comments

* add more detail comments.

committed Jan 23, 2020

b9328d02 Browse Files

pooling.cc improvements (#4767) · 23ba37d4
Alexander Pivovarov committed Jan 22, 2020

23ba37d4 Browse Files

22 Jan, 2020 4 commits
- Improve CUDA conv2d_transpose_nchw (#4762) · 4f92cfe5
```
- combine pad and dilate;
- fix for the issue https://discuss.tvm.ai/t/compile-error-for-cuda-target/4164
- fix for the issue https://github.com/apache/incubator-tvm/pull/4472
```
  Alex Gladkov committed Jan 22, 2020
  4f92cfe5 Browse Files
- Remove run_infer_type duplicates (#4766) · cf3e7865
  Alexander Pivovarov committed Jan 22, 2020
  
  cf3e7865 Browse Files
- Fix padding in pooling op (#4738) · 4dbe4d98
  Alexander Pivovarov committed Jan 21, 2020
  
  4dbe4d98 Browse Files
- [REFACTOR] driver.h -> driver_api.h (#4760) · fc1a1d83
```
"driver" normally refers to the "main" function.
Rationale: the header exposes set of APIs to drive compilation
and should be named as driver api to best reflect its usage.
```
  Tianqi Chen committed Jan 21, 2020
  fc1a1d83 Browse Files
21 Jan, 2020 4 commits

[Docs] Bring Your Own Codegen Guide -- Part 2 (#4718) · dcb556da
```
* BYOC Tutorial -- part 2

* Fix comments

* Address comments
```
Cody Yu committed Jan 21, 2020
dcb556da Browse Files
[INFO] Add .asf.yaml for github info (#4761) · 1d40dc0f
Tianqi Chen committed Jan 21, 2020

1d40dc0f Browse Files
[REFACTOR] top->te (#4759) · 55d81925
```
Bring up namespace te -- Tensor expression language DSL.
```
Tianqi Chen committed Jan 21, 2020
55d81925 Browse Files

[REFACTOR] Establish printer in the source folder (#4752) · e4d817d4

* [REFACTOR] Establish printer in the source folder.

As we move towards the unified IR, we will eventually want to build a unified
printers for both relay and TIR.

This PR isolate the printer component into a separate folder in src as a first step.

- Refactored the Doc DSL using Object, clean up APIs.
- Isolate out the meta data into a header.
- move printer into relay_text_printer, add comments about further TODos.

* Rename NodePrinter -> ReprPrinter to distinguish it from other printers

committed Jan 20, 2020

e4d817d4 Browse Files

20 Jan, 2020 3 commits
- Expose relay BindParamsByName to Python (#4751) · f8f75ca2
```
* expose BindParamByName to python

* fixed alpha equal test
```
  masahi committed Jan 21, 2020
  f8f75ca2 Browse Files
- [REFACTOR][TYPE] Finish move all types to IR. (#4746) · 2c0c1849
```
* [REFACTOR][TYPE] Finish move all types to IR.

- Move definition of Ref and TensorType to ir
- Move type_functor.h to public header.
- Rename RefType -> RelayRefType for clarity.

* Add atol
```
  Tianqi Chen committed Jan 20, 2020
  2c0c1849 Browse Files
- Add CUDA conv2d for NHWC layout (#4737) · ee0af843
  Alex Gladkov committed Jan 19, 2020
  
  ee0af843 Browse Files
19 Jan, 2020 3 commits

[REFACTOR][CODEGEN] codegen->target, build_module->driver (#4742) · 33b0831c

This PR moves the codegen related code into the target folder,
as they are target specific functionalities.

We also adopt the term "compiler driver" in common compiler infra
such as rust, GHC and clang.
As a result, build_module is moved into the driver folder.

committed Jan 19, 2020

33b0831c Browse Files

Fix demo dockerfile build failed (#4744) · 992b5b54
HUAN-PING SU committed Jan 18, 2020

992b5b54 Browse Files

[REFACTOR] Establish tir (#4740) · cf59b206

TIR is the new namespace for low-level IR
for tensor-level optimizations and loop transformations.

This PR establishes the namespace and files.

- lowered_func.h,buffer.h,data_layout.h -> tir/buffer.h,tir/data_layout.h,tir/lowered_func.h
- ir.h -> tir/expr.h, tir/stmt.h
- ir_functor_ext.h -> tir/expr_functor.h, tir/stmt_functor.h

committed Jan 18, 2020

cf59b206 Browse Files

18 Jan, 2020 3 commits

Fix dense (#4728) · 7e392019
Haichen Shen committed Jan 18, 2020

7e392019 Browse Files
[runtime][refactor] Unify vm and interpreter objects (#4693) · acbf8851
```
* unify vm and interpreter objects

* move closure back vm

* adt/closure back to vm.adt/vm.closure

* closure base
```
Zhi committed Jan 18, 2020
acbf8851 Browse Files

[CodeGen][CUDA] Improve CUDA vectorizer (#4736) · 2630ffcb

- Fixes issues to enable fp16 vectorizer. Now correct packing and
  unpacking CUDA code will be emitted. Enabled more unit tests.

- Do not emit code to read the first lane from an undef variable

  int _3;
  _3 = _3 & ~(0x000000ff << 0) | ...

  and emit the following code instead:

  _3 = (((0x000000ff & (_1 >> 0))+(0x000000ff & (_2 >> 0))) << 0);

  Note that nvcc 10.2 is forgiving and emits the same code for both cases.
  A warning appears in test_codegen_cuda.py.

Signed-off-by: Wei Pan <weip@nvidia.com>

committed Jan 17, 2020

2630ffcb Browse Files

17 Jan, 2020 6 commits

[VTA][TSIM] Enable TSIM CI Testing (#4407) · 2738eddf

* Update task_python_vta.sh

* install sbt=1.1.1 with apt-get

* update verilator_opt

* install verilator with major version 4.0

* disable multi-threading for now

* bug fix for correcting uop fetch address in LoadUop module

* bug fix for correcting uop fetch address in LoadUop module

* adjustment to read from dram_offset

* enable USE_THREADS with verilator 4.x

* DEBUG: try avoid core dump with verilator 4.x

* bug fix in LoadUop module

* log mega cycles in tsim

* download cat.png to avoid fetching in each run

* bug fix in LoadUop module

* solve dram_even/sram_even issue

* bug fix

* introduce scalalint in ci

* speedup tsim in ci

* bug fix

* lint scala code before building

* disable multi-threading

* split fsim/tsim script

* update Jenkins settings

* duplicate task_python_vta_fsim.sh as task_python_vta.sh for now

Co-authored-by: Thierry Moreau <tmoreau@octoml.ai>

committed Jan 17, 2020

2738eddf Browse Files

[REFACTOR] Get rid of packed_func_ext. (#4735) · 2f8a01f7

Move the conversion extensions to the specific class definitions
so that we longer need to include packed_func_ext.

committed Jan 17, 2020

2f8a01f7 Browse Files

[x86 schedule] Fallback schedule for Int8 depthwise. (#4733) · 703ed9b7
Animesh Jain committed Jan 17, 2020

703ed9b7 Browse Files

[TOOLS] JSON upgrader to upgrade serialized json. (#4730) · 67b97e5a

During Unified IR refactor we will change the structure of IRs.
This will cause certain historical modules stored via json no longer
able to be loaded by the current version.

This PR introduces a backward compatible layer to try its best effort
to upgrade json from previous version(this case 0.6) to the current version.
We mainly aim to support update of high-level ir(relay).

committed Jan 17, 2020

67b97e5a Browse Files

[QNN] Conv2D type checking for kernel per-channel scales. (#4732) · a5bb789a

* [QNN] Conv2D type checking for kernel per-channel scales.

* Address commments.

* Address comments.

* - Adding safety checks for downcasts.

Co-authored-by: shoubhik <shoubhikbhatti@gmail.com>

committed Jan 17, 2020

a5bb789a Browse Files

[VTA] Update Jenkinsfile for VTA test with TSIM (#4734) · 03ffb01c
```
* [VTA] Update Jenkinsfile for VTA test with TSIM

* duplicate task_python_vta.sh multiple copies for now
```
Liangfu Chen committed Jan 17, 2020
03ffb01c Browse Files