Commits · bef00f7ecc4d04828612f3f18837294da47f2841 · wenyuanbo / tic

25 Jan, 2020 1 commit
- Bump prebuilt-image version in demo dockerfile (#4770) · bef00f7e
  HUAN-PING SU committed Jan 26, 2020
  
  bef00f7e Browse Files
24 Jan, 2020 5 commits
- [Bugfix][Frontend][TF] Fix incorrect calculations in tf SLICE (#4518) · 9bd2c7b4
```
* fix formula for calculating end indices when size[i] == -1
* add a test case for size[i] == -1
* discard expanding dimension of begin_value & end_value since
  it is needed only if you pass them as scalars not as tensors.
* discard 'slice_tensor' variable so that implementation matches
  the tf parser pattern
```
  Ina Dobreva committed Jan 23, 2020
  9bd2c7b4 Browse Files
- add missing nullptr check (#4773) · 26621257
  masahi committed Jan 24, 2020
  
  26621257 Browse Files
- [TOPI] Remove cpp upsampling and resize op (#4769) · 69d2f9bd
```
* remove cpp upsampling

* remove cpp resize
```
  masahi committed Jan 23, 2020
  69d2f9bd Browse Files
- Fix Tensorflow conv3d pad bug, add non-cubic data and kernel tests (#4772) · 1ae44cf0
  Alex Gladkov committed Jan 23, 2020
  
  1ae44cf0 Browse Files
- [Doc] TVM_REGISTER_API -> TVM_REGISTER_GLOBAL (#4768) · 4d4346d1
  hlu1 committed Jan 24, 2020
  
  4d4346d1 Browse Files
23 Jan, 2020 2 commits

[VTA] Support network which have no unique operator as start/stop name for graph pack. (#4703) · b9328d02

* [VTA] Support network which have no unique operator as start/stop name
for graph pack.

[Issue]
  Current vta use 'start' and 'stop' name to define the pack start point
  and end point, but this method not work for these network which have
  no 2 unique operator as  start point and stop point.

[Solution]
  In this solution we give 2 addtional parameters start_name_indx and
  stop_name_indx to make vta pack logic work with the said network,
  for exampl for following networks which have no unique operator,

  %0 = nn.add
  %1 = nn.conv2d
  %2 = nn.batch_norm
  %3 = nn.leaky_relu
  %4 = nn.add
  %5 = nn.conv2d
  %6 = nn.batch_norm
  %7 = nn.leaky_relu
  %8 = nn.add

  with this solution we can use following parameter format to make
  vta work on it.

  relay_prog = graph_pack(
                //....
                start_name="nn.add",
                stop_name="nn.add",
                start_name_idx=0,
                stop_name_idx=4)

  to apply on new network, by printing the network we can get index information like following.

  print(mod.astext(show_meta_data=False))
  relay_prog = graph_pack(mod
                          ...
                          start_name="nn.add",
                          stop_name="nn.add",
                          start_name_idx=0,
                          stop_name_idx=4)

* address review comments and fix index count bug

issue:
when do print(mod), the output not only the Call is also have other type
like Var, need add logic to count all except meta.

solution:
add related logic

* address review comments.

* address review comments

* add more detail comments.

committed Jan 23, 2020

b9328d02 Browse Files

pooling.cc improvements (#4767) · 23ba37d4
Alexander Pivovarov committed Jan 22, 2020

23ba37d4 Browse Files

22 Jan, 2020 4 commits
- Improve CUDA conv2d_transpose_nchw (#4762) · 4f92cfe5
```
- combine pad and dilate;
- fix for the issue https://discuss.tvm.ai/t/compile-error-for-cuda-target/4164
- fix for the issue https://github.com/apache/incubator-tvm/pull/4472
```
  Alex Gladkov committed Jan 22, 2020
  4f92cfe5 Browse Files
- Remove run_infer_type duplicates (#4766) · cf3e7865
  Alexander Pivovarov committed Jan 22, 2020
  
  cf3e7865 Browse Files
- Fix padding in pooling op (#4738) · 4dbe4d98
  Alexander Pivovarov committed Jan 21, 2020
  
  4dbe4d98 Browse Files
- [REFACTOR] driver.h -> driver_api.h (#4760) · fc1a1d83
```
"driver" normally refers to the "main" function.
Rationale: the header exposes set of APIs to drive compilation
and should be named as driver api to best reflect its usage.
```
  Tianqi Chen committed Jan 21, 2020
  fc1a1d83 Browse Files
21 Jan, 2020 4 commits

[Docs] Bring Your Own Codegen Guide -- Part 2 (#4718) · dcb556da
```
* BYOC Tutorial -- part 2

* Fix comments

* Address comments
```
Cody Yu committed Jan 21, 2020
dcb556da Browse Files
[INFO] Add .asf.yaml for github info (#4761) · 1d40dc0f
Tianqi Chen committed Jan 21, 2020

1d40dc0f Browse Files
[REFACTOR] top->te (#4759) · 55d81925
```
Bring up namespace te -- Tensor expression language DSL.
```
Tianqi Chen committed Jan 21, 2020
55d81925 Browse Files

[REFACTOR] Establish printer in the source folder (#4752) · e4d817d4

* [REFACTOR] Establish printer in the source folder.

As we move towards the unified IR, we will eventually want to build a unified
printers for both relay and TIR.

This PR isolate the printer component into a separate folder in src as a first step.

- Refactored the Doc DSL using Object, clean up APIs.
- Isolate out the meta data into a header.
- move printer into relay_text_printer, add comments about further TODos.

* Rename NodePrinter -> ReprPrinter to distinguish it from other printers

committed Jan 20, 2020

e4d817d4 Browse Files

20 Jan, 2020 3 commits
- Expose relay BindParamsByName to Python (#4751) · f8f75ca2
```
* expose BindParamByName to python

* fixed alpha equal test
```
  masahi committed Jan 21, 2020
  f8f75ca2 Browse Files
- [REFACTOR][TYPE] Finish move all types to IR. (#4746) · 2c0c1849
```
* [REFACTOR][TYPE] Finish move all types to IR.

- Move definition of Ref and TensorType to ir
- Move type_functor.h to public header.
- Rename RefType -> RelayRefType for clarity.

* Add atol
```
  Tianqi Chen committed Jan 20, 2020
  2c0c1849 Browse Files
- Add CUDA conv2d for NHWC layout (#4737) · ee0af843
  Alex Gladkov committed Jan 19, 2020
  
  ee0af843 Browse Files
19 Jan, 2020 3 commits

[REFACTOR][CODEGEN] codegen->target, build_module->driver (#4742) · 33b0831c

This PR moves the codegen related code into the target folder,
as they are target specific functionalities.

We also adopt the term "compiler driver" in common compiler infra
such as rust, GHC and clang.
As a result, build_module is moved into the driver folder.

committed Jan 19, 2020

33b0831c Browse Files

Fix demo dockerfile build failed (#4744) · 992b5b54
HUAN-PING SU committed Jan 18, 2020

992b5b54 Browse Files

[REFACTOR] Establish tir (#4740) · cf59b206

TIR is the new namespace for low-level IR
for tensor-level optimizations and loop transformations.

This PR establishes the namespace and files.

- lowered_func.h,buffer.h,data_layout.h -> tir/buffer.h,tir/data_layout.h,tir/lowered_func.h
- ir.h -> tir/expr.h, tir/stmt.h
- ir_functor_ext.h -> tir/expr_functor.h, tir/stmt_functor.h

committed Jan 18, 2020

cf59b206 Browse Files

18 Jan, 2020 3 commits

Fix dense (#4728) · 7e392019
Haichen Shen committed Jan 18, 2020

7e392019 Browse Files
[runtime][refactor] Unify vm and interpreter objects (#4693) · acbf8851
```
* unify vm and interpreter objects

* move closure back vm

* adt/closure back to vm.adt/vm.closure

* closure base
```
Zhi committed Jan 18, 2020
acbf8851 Browse Files

[CodeGen][CUDA] Improve CUDA vectorizer (#4736) · 2630ffcb

- Fixes issues to enable fp16 vectorizer. Now correct packing and
  unpacking CUDA code will be emitted. Enabled more unit tests.

- Do not emit code to read the first lane from an undef variable

  int _3;
  _3 = _3 & ~(0x000000ff << 0) | ...

  and emit the following code instead:

  _3 = (((0x000000ff & (_1 >> 0))+(0x000000ff & (_2 >> 0))) << 0);

  Note that nvcc 10.2 is forgiving and emits the same code for both cases.
  A warning appears in test_codegen_cuda.py.

Signed-off-by: Wei Pan <weip@nvidia.com>

committed Jan 17, 2020

2630ffcb Browse Files

17 Jan, 2020 9 commits

[VTA][TSIM] Enable TSIM CI Testing (#4407) · 2738eddf

* Update task_python_vta.sh

* install sbt=1.1.1 with apt-get

* update verilator_opt

* install verilator with major version 4.0

* disable multi-threading for now

* bug fix for correcting uop fetch address in LoadUop module

* bug fix for correcting uop fetch address in LoadUop module

* adjustment to read from dram_offset

* enable USE_THREADS with verilator 4.x

* DEBUG: try avoid core dump with verilator 4.x

* bug fix in LoadUop module

* log mega cycles in tsim

* download cat.png to avoid fetching in each run

* bug fix in LoadUop module

* solve dram_even/sram_even issue

* bug fix

* introduce scalalint in ci

* speedup tsim in ci

* bug fix

* lint scala code before building

* disable multi-threading

* split fsim/tsim script

* update Jenkins settings

* duplicate task_python_vta_fsim.sh as task_python_vta.sh for now

Co-authored-by: Thierry Moreau <tmoreau@octoml.ai>

committed Jan 17, 2020

2738eddf Browse Files

[REFACTOR] Get rid of packed_func_ext. (#4735) · 2f8a01f7

Move the conversion extensions to the specific class definitions
so that we longer need to include packed_func_ext.

committed Jan 17, 2020

2f8a01f7 Browse Files

[x86 schedule] Fallback schedule for Int8 depthwise. (#4733) · 703ed9b7
Animesh Jain committed Jan 17, 2020

703ed9b7 Browse Files

[TOOLS] JSON upgrader to upgrade serialized json. (#4730) · 67b97e5a

During Unified IR refactor we will change the structure of IRs.
This will cause certain historical modules stored via json no longer
able to be loaded by the current version.

This PR introduces a backward compatible layer to try its best effort
to upgrade json from previous version(this case 0.6) to the current version.
We mainly aim to support update of high-level ir(relay).

committed Jan 17, 2020

67b97e5a Browse Files

[QNN] Conv2D type checking for kernel per-channel scales. (#4732) · a5bb789a

* [QNN] Conv2D type checking for kernel per-channel scales.

* Address commments.

* Address comments.

* - Adding safety checks for downcasts.

Co-authored-by: shoubhik <shoubhikbhatti@gmail.com>

committed Jan 17, 2020

a5bb789a Browse Files

[VTA] Update Jenkinsfile for VTA test with TSIM (#4734) · 03ffb01c
```
* [VTA] Update Jenkinsfile for VTA test with TSIM

* duplicate task_python_vta.sh multiple copies for now
```
Liangfu Chen committed Jan 17, 2020
03ffb01c Browse Files
export builtin_fp16 on Windows (#4731) · 4e1ca857
vexilligera committed Jan 17, 2020

4e1ca857 Browse Files
[Relay] Invoke tvm::build from relay compile_engine and interpreter (#4723) · 3279957f
hlu1 committed Jan 17, 2020

3279957f Browse Files

[REFACTOR] Polish runtime (#4729) · b171cf1d

- Remove operator bool from base object ref macro
  - Raitionale: operator bool can be dangerous for sub-classes
    that also overloads other operators(e.g. ==).
  - If bool is still needed, use explicit operator bool.
- Use absolute include when necessary
- Move type related util to data_type
- Isolate stackvm code from compiler

committed Jan 16, 2020

b171cf1d Browse Files

16 Jan, 2020 6 commits

[Docs] Convert Layout pass. (#4664) · eaa23800
```
* [Docs] Convert Layout pass.

* Address comments. Section 3 massaging.

* Address comments.
```
Animesh Jain committed Jan 16, 2020
eaa23800 Browse Files

[REFACTOR] top - namespace for Tensor Operation DSL (#4727) · b8261426

* [REFACTOR] introduce top - Tensor Operation DSL.

Historically we put Tensor, Schedule and compute under the root tvm namespace.
This is no longer a good idea as the project's scope grows larger
than the tensor operation DSL.

This PR introduces top -- a namespace for tensor operational
DSL concepts such as schedule, tensor, compute.
We moved the related files to the new top subfolder.

* Move relevant files into include/tvm/top and src/top

committed Jan 16, 2020

b8261426 Browse Files

[Docs] Bring Your Own Codegen Guide -- Part 1 (#4602) · ddef9403

* BYOC tutorial: codegen C

* Address comments

* Address comments

* Add build option

* Address comments

* Use TVM_DLL_EXPORT_TYPED_FUNC

committed Jan 17, 2020

ddef9403 Browse Files

[Runtime] EdgeTPU runtime for Coral Boards (#4698) · 31021d2b
Thierry Moreau committed Jan 16, 2020

31021d2b Browse Files

[REFACTOR][ARITH] Unified IR, introduce arith subfolder. (#4722) · c7a83199

Spread the arithmetic.h into several components and move
into arith subfolder.

The arith namespace will be used for arithmetic expression
pattern detections and simplifications.

committed Jan 16, 2020

c7a83199 Browse Files

[Relay][Op] Add type check to dense (#4724) · dd13c2c2
Wei Chen committed Jan 16, 2020

dd13c2c2 Browse Files