- 23 Aug, 2018 1 commit
-
-
MORITA Kazutaka committed
-
- 13 Jun, 2018 1 commit
-
-
Tianqi Chen committed
-
- 11 Jun, 2018 1 commit
-
-
Tianqi Chen committed
-
- 31 May, 2018 1 commit
-
-
* fix problem with the latest LLVM * add if-defs to support older LLVMs
Hiroshi Inoue committed
-
- 18 May, 2018 1 commit
-
-
wfu committed
-
- 11 May, 2018 1 commit
-
-
Tianqi Chen committed
-
- 17 Mar, 2018 1 commit
-
-
Tianqi Chen committed
-
- 09 Nov, 2017 1 commit
-
-
* Support vector operations for AMD (llvm IR) * fix whitespace * update comments, docstring * inline AMD GPU functions
eqy committed
-
- 03 Nov, 2017 1 commit
-
-
Tianqi Chen committed
-
- 26 Oct, 2017 1 commit
-
-
* view llvm ir and gcn asm with module.get_source(...) * fix lint
masahi committed
-
- 20 Oct, 2017 1 commit
-
-
* added math function support * bug fix extern func call in llvm based codegen lint fix fix build bug fix extern func call in llvm based codegen * moved rocm bitcodes detection to python
masahi committed
-
- 15 Oct, 2017 1 commit
-
-
Tianqi Chen committed
-
- 13 Oct, 2017 1 commit
-
-
* added support for rocm gpu autodetect * changed type casting from old style to static_cast * fixed code to generate gfx specific code object * fixed namespaces
Aditya Atluri committed
-
- 12 Oct, 2017 1 commit
-
-
masahi committed
-
- 13 Sep, 2017 1 commit
-
-
* added initial llvm codegen for amdgpu * fixed whitespace * fixed hsaco gen from ir * fixed targetmachine for rocm and added GetSource for rocm * fixed whitespace issues * changed statement to use less than 100 lines * added intrinsics for workgroup - rocm * whitespace - newline error fix * fixed error msg for workitem-workgroup intrinsics * added llvm ir dump for rocm codegen * [ROCM] changed codegen to emit proper amdgpu kernel header * fixed whitespace error * fixed whitespace error- 2 * fixed AddFunction to not to use extra arg 1. Changed AddFunctionInternal to not to take extra arg for target type 2. Use Target from CodeGenLLVM to check for AMDGPU target * fixed whitespaces * fixed whitespaces 2 * fixed codegen for AMDGPU - now generating valid IR * fixed codegen depending on code review * reviewed alignment for amd devices * added code to dump code object to file * fixed cpplint errors * print out IR after pass manager * added code to dump asm, obj to file and std string * fixed whitespaces * Update codegen_amdgpu.cc * used registry for amdgpu llvm * Fixed whitespaces * added code for calling linker * fixed formatting errors * added rocm link python interface * fixed pylint issues and added more body to the function * added doc string * added doc string for module * fixed python code after review, fixed llvm object codegen * fixed linker to generate code object * removed dumping to output file and debugging log out * fixed lint for python code * added fault check after running linker * removed print statement in rocm.py * changed rocm lld linker to raise runtimeerror than emitting error log to stderr * changed the way linker command line is pass to subprocess.popen * removed redundant code and reuse tvm utils * removed commented out code * removed cloning of unused modules, and put IR into string
Aditya Atluri committed
-
- 09 Sep, 2017 1 commit
-
-
Tianqi Chen committed
-
- 31 Aug, 2017 1 commit
-
-
Tianqi Chen committed
-
- 28 Aug, 2017 1 commit
-
-
* [CODEGEN] NVPTX backend. * Fix pylint * use fix
Tianqi Chen committed
-