Commits · 502ab605e399f0cdfeec5611b48af94efe436b60 · wenyuanbo / tic

25 Feb, 2020 1 commit
- [LLVM] Fix build breaks from StringRef changes (#4923) · 588523dd
```
- llvm::StringRef to std::string conversion is explicit now.

Signed-off-by: Wei Pan <wpan11nv@nvidia.com>
```
  wpan11nv committed 4 years ago
  588523dd Browse File
19 Jan, 2020 2 commits

[REFACTOR][CODEGEN] codegen->target, build_module->driver (#4742) · 33b0831c

This PR moves the codegen related code into the target folder,
as they are target specific functionalities.

We also adopt the term "compiler driver" in common compiler infra
such as rust, GHC and clang.
As a result, build_module is moved into the driver folder.

committed 5 years ago

33b0831c Browse File

[REFACTOR] Establish tir (#4740) · cf59b206

TIR is the new namespace for low-level IR
for tensor-level optimizations and loop transformations.

This PR establishes the namespace and files.

- lowered_func.h,buffer.h,data_layout.h -> tir/buffer.h,tir/data_layout.h,tir/lowered_func.h
- ir.h -> tir/expr.h, tir/stmt.h
- ir_functor_ext.h -> tir/expr_functor.h, tir/stmt_functor.h

committed 5 years ago

cf59b206 Browse Directory

09 Jan, 2020 1 commit

[REFACTOR][IR] tvm::Expr -> PrimExpr(Primitive Expr) (#4669) · d6a23cf5

* [REFACTOR][IR] tvm::Expr -> PrimExpr(Primitive Expr)

As part of unified IR, we will need to unify relay::Expr
and the current tvm::Expr under the same base type.

From the techinical point of view. tvm::Expr is a "primitive"
expression that only contains POD types and handles and does
not do life-cycle management.

This PR renames Expr->PrimExpr to clarify that.
We will send a subsequent PR to introduce the base expr class.

* Remove legacy VarExpr and ExprHash/Equal

committed 5 years ago

d6a23cf5 Browse Directory

08 Jan, 2020 1 commit

[REFACTOR][IR] Add Node suffix to low-level IR nodes (#4649) · f4c5f93b

* [REFACTOR][IR] Variable -> VarNode

* [REFACTOR][IR] Add/Sub/Mul/Div -> AddNode/SubNode etc.

* [REFACTOR][IR] Min/Max/FloorDiv/FloorMod -> MinNode/MaxNode etc.

* [REFACTOR][IR] EQ/NE/LT/LE/GT/GE/Select -> EQNode/NENode etc.

* [REFACTOR][IR] Add Node suffix to Select/Call/Load/Ramp/Shuffle/Let

* [REFACTOR][IR] Add node suffix to IntImm/UIntImm/FloatImm/StringImm

* [REFACTOR][IR] Add Node suffix to Any, AttrStmt, AssertStmt

* [REFACTOR][IR] Add Node suffix to Store/Provide/Allocate/Free

* [REFACTOR][IR] Add Node suffix to ProducerConsumer

* Fix lint

* style updates, test fixes

committed 5 years ago

f4c5f93b Browse Directory

04 Jan, 2020 1 commit

[REFACTOR] TVM_REGISTER_API -> TVM_REGISTER_GLOBAL (#4621) · 81523604

TVM_REGSISTER_API is an alias of TVM_REGISTER_GLOBAL.
In the spirit of simplify redirections, this PR removes
the original TVM_REGISTER_API macro and directly use TVM_REGISTER_GLOBAL.

This type of refactor will also simplify the IDE navigation tools
such as FFI navigator to provide better code reading experiences.

Move EnvFunc's definition to node.

committed 5 years ago

81523604 Browse Directory

22 Dec, 2019 1 commit

[REFACTOR][DTYPE] Isolate dtype to runtime (#4560) · 7fa8aab5

dtype.h -> runtime/data_type.h

Changes:
- Rename all old reference of tvm::Type to DataType
- ExprNode.type -> ExprNode.dtype
- Expr.type() -> Expr.dtype()
- Change Expr related functions to expr_operator.
  - DataType::min() -> min_value(DataType)
  - DataType::max() -> max_value(DataType)
- Move type constructor Int, UInt, Float, Handle, Bool into DataType.
  - Int(bits) -> DataType::Int(bits)
  - UInt(bits) -> DataType::UInt(bits)

committed 5 years ago

7fa8aab5 Browse Directory

12 Dec, 2019 1 commit
- Fix build for llvm newer than 9.0 (#4515) · fb12f356
  Dmitri Makarov committed 5 years ago
  
  fb12f356 Browse Directory
08 Dec, 2019 1 commit
- [Codegen] fix bug on LLVM 10.0 (#4480) · 03a59bc9
  Yuanqiang Liu committed 5 years ago
  
  03a59bc9 Browse Directory
24 Nov, 2019 1 commit

[LINT] Remove unnecessary copyright message for files with ASF header (#4409) · c8772288

* [LINT] Improve the check tool to handle ASF copyright message.

* [LINT] Remove unnecessary copyright message as per ASF requirement.

* Fix codegen hybrid

* [LINT] Broaden license checks to include html, xml

* [LINT] Fix rest of the files

* Fix notice

* [LINT] Improve check file type error message

committed 5 years ago

c8772288 Browse Directory

20 Nov, 2019 1 commit
- fix build with llvm trunk (#4386) · da9a0330
  masahi committed 5 years ago
  
  da9a0330 Browse Directory
15 Nov, 2019 2 commits

Add workgroup size attribute to AMDGPU functions in codegen (#4342) · 0a9f7e9a

When we did not set the workgroup size, LLVM will use too many registers
for kernel launches with many threads. This resulted in "invalid ISA"
errors. Here we set the maximum workgroup size to the maximum threads
per block from the device API.

Of course, one might look into allowing configurations with fewer
threads at runtime to use more registers.

committed 5 years ago

0a9f7e9a Browse Directory

[RUNTIME] Add device query for AMD GcnArch (#4341) · 0235d283
```
* add gcnArch query

* kGcnArch query for cuda is a no-op
```
Peter Yeh committed 5 years ago
0235d283 Browse Directory

05 Nov, 2019 1 commit

Require LLVM >= 9 for AMDGPU backend (#4253) · 635831c7

LLVM 8 will crash when loading the bitcodes

This is a runtime check as the file will be compiled in even when
USE_ROCM OFF is used in the configuration if ROCM is installed
in the default location.

Fixes: #4087

committed 5 years ago

635831c7 Browse Directory

11 Oct, 2019 1 commit
- force code object v2 for amd gpu backend (#4099) · 15ae9780
  Peter Yeh committed 5 years ago
  
  15ae9780 Browse Directory
04 Oct, 2019 1 commit
- [llvm] switch to use Align for llvm trunk (#4051) · 59d8d400
  Yizhi Liu committed 5 years ago
  
  59d8d400 Browse Directory
07 Sep, 2019 1 commit
- Add .hsaco save/load for ROCm target (#3852) · e8c6adc6
```
fix lld
```
  Peter Yeh committed 5 years ago
  e8c6adc6 Browse Directory
10 Apr, 2019 1 commit

[REFACTOR] Use more TypedPackedFuncs (#2981) · 51785062

* Add `set_body_simple` to Registry, refactor a lot of code to use it

* Add more types to Relay PackedFuncs

* Add Registry::set_body_method to easily make Node methods into
PackedFuncs

* Add set_body_method, set_body_node_method; start typing api_lang

* Add some docs, remove unused script

* Fix mysterious linter problem

* Touch up api_ir.cc

* Fix some issues with TOPI argument counts

* Revert changes to topi.cc to avoid problems with optional arguments

* A little more cleanup

* Type more of the api _ functions

* Whitespace

* Finalize names and docs for new registry helpers

* Update docs

committed 5 years ago

51785062 Browse Directory

08 Apr, 2019 1 commit

[HEADER] Add Header to Comply with ASF Release Policy (#2982) · cffb4fba

* [HEADER] ASF header dir=include

* [HEADER] ASF Header dir=src

* [HEADER] ASF Header -dir=python

* [HEADER] ASF header dir=topi

* [HEADER] ASF Header dir=nnvm

* [HEADER] ASF Header -dir=tutorials

* [HEADER] ASF Header dir=tests

* [HEADER] ASF Header -dir=docker

* fix whitespace

* [HEADER] ASF Header -dir=jvm

* [HEADER] ASF Header -dir=web

* [HEADER] ASF Header --dir=apps

* [HEADER] ASF Header --dir=vta

* [HEADER] ASF Header -dir=go

* temp

* [HEADER] ASF Header --dir=rust

* [HEADER] Add ASF Header --dir=cmake

* [HEADER] ASF Header --dir=docs

* [HEADER] Header for Jenkinsfile

* [HEADER] ASF Header to toml and md

* [HEADER] ASF Header to gradle

* Finalize rat cleanup

* Fix permission

* Fix java test

* temporary remove nnvm onnx test

committed 5 years ago

cffb4fba Browse Directory

27 Feb, 2019 1 commit
- [CODEGEN LLVM GPU] Initialize llvm before lookup for the target (#2683) · 2bf34581
  Denis Khalikov committed 5 years ago
  
  2bf34581 Browse Directory
06 Nov, 2018 1 commit
- [CODEGEN][LLVM] Cache packed func ptr, lift alloca (#2070) · fdf035e8
  Tianqi Chen committed 6 years ago
  
  fdf035e8 Browse Directory
01 Nov, 2018 1 commit
- [Cleanliness] [Easy] Make TVM leak-sanitizer and Wnon-virtual-dtor clean. (#2046) · 0319f99d
  Andrew Tulloch committed 6 years ago
  
  0319f99d Browse Directory
23 Aug, 2018 1 commit
- Remove leading "./" from include paths (#1640) · b95b5958
  MORITA Kazutaka committed 6 years ago
  
  b95b5958 Browse Directory
13 Jun, 2018 1 commit
- [BUILD] Enable path option for ROCM, CUDA, Vulkan, simplify optional build (#1270) · 7afeab07
  Tianqi Chen committed 6 years ago
  
  7afeab07 Browse Directory
11 Jun, 2018 1 commit
- [BUILD] Switch to CMake only Infra (#1254) · 2d3031ee
  Tianqi Chen committed 6 years ago
  
  2d3031ee Browse Directory
31 May, 2018 1 commit
- Fix compilation failure with latest LLVM (#1208) · 8157e5b1
```
* fix problem with the latest LLVM

* add if-defs to support older LLVMs
```
  Hiroshi Inoue committed 6 years ago
  8157e5b1 Browse Directory
18 May, 2018 1 commit
- [llvm] fixed issue with llvm 5 vs 6 (#1167) · 21bf9839
  wfu committed 6 years ago
  
  21bf9839 Browse Directory
11 May, 2018 1 commit
- [CODEGEN] Enable cross compile of AMDGPU without rocm, update rpc (#1154) · 51c40b4f
  Tianqi Chen committed 6 years ago
  
  51c40b4f Browse Directory
17 Mar, 2018 1 commit
- [RUNTIME] More reliable thread enumeration (#1017) · 6588662f
  Tianqi Chen committed 6 years ago
  
  6588662f Browse Directory
09 Nov, 2017 1 commit

inline AMD GPU functions (#625) · 8fea0879

* Support vector operations for AMD (llvm IR)

* fix whitespace

* update comments, docstring

* inline AMD GPU functions

committed 7 years ago

8fea0879 Browse Directory

03 Nov, 2017 1 commit
- [DLPack] Upgrade dlpack to 0.2 (#609) · 8214d6ca
  Tianqi Chen committed 7 years ago
  
  8214d6ca Browse Directory
26 Oct, 2017 1 commit
- [ROCM] View llvm ir and gcn asm with module.get_source(...) (#590) · 6a5d6165
```
* view llvm ir and gcn asm with module.get_source(...)

* fix lint
```
  masahi committed 7 years ago
  6a5d6165 Browse Directory
20 Oct, 2017 1 commit

[ROCM] Working math function support for ROCm backend, a bug fix in LLVM based codegen (#570) · 326edd76

* added math function support

* bug fix extern func call in llvm based codegen

lint fix

fix build

bug fix extern func call in llvm based codegen

* moved rocm bitcodes detection to python

committed 7 years ago

326edd76 Browse Directory

15 Oct, 2017 1 commit
- [CODEGEN] Bugfix multiple condition generation (#558) · 163c4795
  Tianqi Chen committed 7 years ago
  
  163c4795 Browse Directory
13 Oct, 2017 1 commit

added support for rocm gpu autodetect (#549) · ed783689

* added support for rocm gpu autodetect

* changed type casting from old style to static_cast

* fixed code to generate gfx specific code object

* fixed namespaces

committed 7 years ago

ed783689 Browse Directory

12 Oct, 2017 1 commit
- fixed rocm runtime. set default gcn arch to be gfx803 (#544) · 624c37df
  masahi committed 7 years ago
  
  624c37df Browse Directory
13 Sep, 2017 1 commit

[BACKEND] initial llvm codegen for amdgpu (#402) · 891e226b

* added initial llvm codegen for amdgpu

* fixed whitespace

* fixed hsaco gen from ir

* fixed targetmachine for rocm and added GetSource for rocm

* fixed whitespace issues

* changed statement to use less than 100 lines

* added intrinsics for workgroup - rocm

* whitespace - newline error fix

* fixed error msg for workitem-workgroup intrinsics

* added llvm ir dump for rocm codegen

* [ROCM] changed codegen to emit proper amdgpu kernel header

* fixed whitespace error

* fixed whitespace error- 2

* fixed AddFunction to not to use extra arg

1. Changed AddFunctionInternal to not to take extra arg for target type
2. Use Target from CodeGenLLVM to check for AMDGPU target

* fixed whitespaces

* fixed whitespaces 2

* fixed codegen for AMDGPU - now generating valid IR

* fixed codegen depending on code review

* reviewed alignment for amd devices

* added code to dump code object to file

* fixed cpplint errors

* print out IR after pass manager

* added code to dump asm, obj to file and std string

* fixed whitespaces

* Update codegen_amdgpu.cc

* used registry for amdgpu llvm

* Fixed whitespaces

* added code for calling linker

* fixed formatting errors

* added rocm link python interface

* fixed pylint issues and added more body to the function

* added doc string

* added doc string for module

* fixed python code after review, fixed llvm object codegen

* fixed linker to generate code object

* removed dumping to output file and debugging log out

* fixed lint for python code

* added fault check after running linker

* removed print statement in rocm.py

* changed rocm lld linker to raise runtimeerror than emitting error log to stderr

* changed the way linker command line is pass to subprocess.popen

* removed redundant code and reuse tvm utils

* removed commented out code

* removed cloning of unused modules, and put IR into string

committed 7 years ago

891e226b Browse Directory

09 Sep, 2017 1 commit
- [LLVM] Protect ll when emit pass (#436) · 0c9adc5b
  Tianqi Chen committed 7 years ago
  
  0c9adc5b Browse Directory
31 Aug, 2017 1 commit
- [BACKEND] Allow nvptx to pass ll ir to CUDAModule (#404) · b8c8aadf
  Tianqi Chen committed 7 years ago
  
  b8c8aadf Browse Directory
28 Aug, 2017 1 commit
- [CODEGEN] NVPTX backend. (#392) · 0560e156
```
* [CODEGEN] NVPTX backend.

* Fix pylint

* use fix
```
  Tianqi Chen committed 7 years ago
  0560e156 Browse Directory