- 27 Oct, 2019 1 commit
-
-
* app init push * fix on readme * change name, add bit serial explanantion * rm serialLoadMM, change doc * syntax change for readme * add parallel test functionality * fix readme * add python doc * syntax * init commit * fix empty line * fix typo
Benjamin Tu committed
-
- 24 Oct, 2019 1 commit
-
-
* add tensor core support * avoid memory bank conflict * fix thread sync & better performance * better performance * add schedule test for conv2d * extend into BatchMatMul * support config fragment shape and layout using intrinsic * add TensorCore tutorial * add int support and fix lint * address comment * add 32*16*8 TensorCore test * fix wmma include logic
Siyuan Feng committed
-
- 10 Oct, 2019 1 commit
-
-
* app init push * fix on readme * change name, add bit serial explanantion * rm serialLoadMM, change doc * syntax change for readme * add parallel test functionality * fix readme * add python doc * syntax
Benjamin Tu committed
-
- 08 Oct, 2019 1 commit
-
-
if n_trial is larger then config space.
Attila Dusnoki committed
-
- 28 Sep, 2019 1 commit
-
-
Tianqi Chen committed
-
- 13 Sep, 2019 1 commit
-
-
Issue: RPC path get changed into "vta_rpc" from "pynq_rpc", but related document still use old informaiton. Solution: Update RPC path information.
Hua Jiang committed
-
- 09 Sep, 2019 1 commit
-
-
Luis Vega committed
-
- 07 Sep, 2019 1 commit
-
-
* [VTA] Support TLPP in function simulator. Issue: currently vta function simulator just doing serialized instruction execution, the dependency logic of runtime ISA which use for task level pipe line parallelism can not get verified by function simulator. Solution: make the simulator driver to be multiple thread and support TLPP. Benefit: TLPP support VTA function simulator would make VTA logic testing/debug /change more easy. replace boost lockfree queue add configure control for simulator tlpp enable or disable. change code tyle into google style. Wrap queue read/write and sync logic to make function call more simple. Add some comments. Remove MT logic, change into Single thread mode. address review comments. code style change to match google code style and add comments. add cmake macro to enable/disable simulator tlpp logic. submodule update. correct file name mentioned in comments. * remove USE_VTA_FSIM_TLPP.
Hua Jiang committed
-
- 05 Sep, 2019 3 commits
-
-
* initial conv2d_transpose * correct select operator * cleanup * fix * fix correcness check * conv2d transpose declaration fix * autotvm conv2d_transpose tuning script * ir pass fix * fix tuning script * deriving params from env, adding bias * removing bias comp from deconvolution * lint * fix * lint * lint * turning off cpu * lint, ops * lint * import fix * removing hard coded values * lint
Thierry Moreau committed -
* adding support for graphpack over multiply op * increasing resnet model coverage * fix indentation * lint * moving recursion limit fix into graphpack pass * moving recursionlimit to relay init * pooling on NCHWnc format * adding more models * deploy_resnet_on_vta.py * trailing line * generalizing to vision models * merge conflicts * fix, apply quantization to VTA only * improving comments * trimming models that have runtime issues for the moment * lint * lint * lint
Thierry Moreau committed -
* rework; * `de10-nano` -> `de10nano`; * fix compilation error; * bug fix; * Update install.md * Update install.md * Update install.md * update with current runtime; * add debug messages; * bug fix in cma kernel module;
Liangfu Chen committed
-
- 04 Sep, 2019 2 commits
- 03 Sep, 2019 1 commit
-
-
* [VTA] Fix TSIM compile error in Linux (add missing -fPIC flag); * [VTA] Fix TSIM compile error in Linux (add missing -fPIC flag); * fix indentation problem;
Liangfu Chen committed
-
- 02 Sep, 2019 1 commit
-
-
Luis Vega committed
-
- 01 Sep, 2019 1 commit
-
-
* [VTA][TSIM] add virtual memory support to tsim example * fix identation * remove USE_TSIM macro and use 32-bit addr instead
Luis Vega committed
-
- 29 Aug, 2019 1 commit
-
-
Issue: RewriteForceSerial is a debug function to force instructions to be serialize instead of parrallel running, by doing so we can isolate some parallel problem or do performance compare between parallel and serialize. But this function have some problem, once get enabled by set debug flag, vta would stuck when running on pynq board. Analysis: once enable RewriteForceSerial, the dependency logic is different with default one, but we still use same logic to generate FINISH and other logic, this would cause dead lock. Solution: give a different dependency settings when enable RewriteForceSerial.
Hua Jiang committed
-
- 27 Aug, 2019 1 commit
-
-
Liangfu Chen committed
-
- 26 Aug, 2019 1 commit
-
-
* initial virtual memory; * initial integration; * include the header file in cmake; * implement allocation with virtual to logical address mapping; * virtual memory for tsim_driver; * implement the missing memory release function; * readability improvement; * readability improvement; * address review comments; * improved robustness in virtual memory allocation; * remove VTA_TSIM_USE_VIRTUAL_MEMORY macro and use virtual memory for tsim by default; * link tvm against vta library; * merge with master * build virtual memory system without linking tvm against vta; * minor change; * reuse VTA_PAGE_BYTES; * using DRAM class from sim_driver as VirtualMemoryManager; * satisfy linter; * add comments in code; * undo changes to Makefile * undo changes to Makefile * retrigger ci; * retrigger ci; * directly call into VirtualMemoryManager::Global()
Liangfu Chen committed
-
- 18 Aug, 2019 1 commit
-
-
* [VTA][TSIM] parallel hardware compilation with macOS and debug support * simplify
Liangfu Chen committed
-
- 14 Aug, 2019 2 commits
-
-
* [VTA][Chisel] scale dram base address in hardware instead of runtime * remove trailing spaces
Luis Vega committed -
Thierry Moreau committed
-
- 13 Aug, 2019 4 commits
-
-
Benjamin Tu committed
-
* [VTA][Chisel] run all unittests by default * better naming * add generated unittest folder to clean rule
Luis Vega committed -
* building TSIM specific library along with fast simulator to quickly switch between dlls * cmake controlled TSIM libraries * always build tsim driver in either simulation modes * build DLLs based on CMAKE flags * updating the jenkinsfile * small restructuring * reducing the cmake flags * update instructions * reverting to 3 flags * update Jenkinsfile * adding new line * enabling TSIM unit and integration tests * fix description * temporarily disabling task_python_vta tests in CPU Build stage * move CPU tests in unit test stage * stage reorg * better make * disabling TSIM tests for now * reverting some restructuring * fix
Thierry Moreau committed -
* added alutest * fix indent * name change for cycle * improved data gen and infra * added alutest * fix indent * name change for cycle * improved data gen and infra * fix space * fix indent * fixes * aluRef * fix randomarary * add * Revert "add" This reverts commit 87077daebbe055dee11f80e37da3a6291138e0f0. * Revert "fix randomarary" This reverts commit df386c1e660eb6ebcff1a1f905610573676f1589. * Revert "aluRef" This reverts commit 8665f0d4a7b12b796b2cb1ca6bf9cfe5613ee389. * should fix dlmc-core
Benjamin Tu committed
-
- 08 Aug, 2019 1 commit
-
-
* fix * fixes
Benjamin Tu committed
-
- 07 Aug, 2019 1 commit
-
-
Thierry Moreau committed
-
- 06 Aug, 2019 1 commit
-
-
Liangfu Chen committed
-
- 03 Aug, 2019 1 commit
-
-
* added wholething * changed build and makefile
Benjamin Tu committed
-
- 31 Jul, 2019 1 commit
-
-
* initial compilation script for chisel-vta; * replace tabs with spaces; * compile script for de10-nano; * remove generated verilog source code; * remove `altsource_probe`, `debounce`, `edge_detect` ip; * replace quartus project files with a single tcl script; * Update install.md * improved makefile-based compilation script; * complete makefile-based compilation of chisel-vta for de10-nano; * install quartus; * conversion to .rbf file; * document chisel-vta compilation process for de10-nano; * rename generated bitstream file; * download and extract custom ip for de10-nano; * minor change * minor change * fix indentation; * bug fix; * improved robustness in makefile; * clean up; * add `.sdc .ipx .qsys` allowance in jenkins; * add ASF header; * add ASF header; * remove IntelShell.scala, update vta_hw.tcl, clean up Makefile & soc_system.qsys; * add ASF header; * keep sources compact; * keep sources compact; * it's not necessary now * AXI4LiteClient -> AXI3Client for IntelShell * remove connection to fpga_only_master; * a few important bug fix: wire reset pin, and set host_r_last to high * remove intel specific interface definition; * add NO_DSP option in Makefile; * AXI4Lite is not used in IntelShell; * minor fix: disable dsp and use logic instead; * quartus version change: 18.0 -> 18.1 * remove altera related statement; * compose compile_design.tcl * initial tcl script for soc_system generation; * remove .qsys file; * remove unused; * .qsys can be generated by tcl script; * remove hps_io and shrink size of soc_system; * integrate into makefile; * version change: 18.0 -> 18.1 * add sample config file for de10-nano; * parameterize DEVICE and PROJECT_NAME * remove extra lines; * brief description on flashing sd card image for de10-nano * docs on building additional components * parameterize DEVICE and DEVICE_FAMILY * parameterize DEVICE and DEVICE_FAMILY * parameterize DEVICE and DEVICE_FAMILY * de10-nano -> de10nano * minor change * add comment in code and document in order to address review comments;
Liangfu Chen committed
-
- 30 Jul, 2019 2 commits
-
-
* fix in IR pass to support padding on 6-d tensors * support for both N>1 and N==1 for padding * batch size > 1 tuning and base config * output formatting * batch conv2d * print all category results * revert to single-batch config * pick record best * fix conv test * improving reporting * address batching bug in fast simulator * fix
Thierry Moreau committed -
Thierry Moreau committed
-
- 29 Jul, 2019 2 commits
-
-
Luis Vega committed
-
* hardware refactor for increased FPGA coverage, small optimizations * fix header * cleaning up parameters that won't be needed for now * streamlining makefile, and simplifying tcl scripts * moving parameter derivation into pkg_config.py, keeping tcl scripts lightweight * refactoring tcl script to avoid global variables * deriving AXI signals in pkg_config.py * unifying address map definition for hardware and software drivers * single channel design for ultra96 to simplify build * enable alu by default, no mul opcode for now * hardware fix * new bitstream; vta version * avoid error when env variable is not set * ultra96 cleanup * further cleaning up tcl script for bitstream generation * preliminary rpc server support on ultra96 * rpc server tracker scripts * ultra96 ldflag * ultra96 support * ultra96 support * cleanup line * cmake support for ultra96 * simplify memory instantiation * cleaning up IP parameter initialization * fix queue instantiation * 2019.1 transition * fix macro def * removing bus width from config * cleanup * fix * turning off testing for now * cleanup ultra96 ps insantiation * minor refactor * adding comments * upgrading to tophub v0.6 * model used in TVM target now refers to a specific version of VTA for better autoTVM scheduling * revert change due to bug * rename driver files to be for zynq-type devices * streamlining address mapping * unifying register map offset values between driver and hardware generator * rely on cma library for cache flush/invalidation * coherence management * not make buffer packing depend on data types that can be wider than 64bits * refactor config derivation to minimize free parameters * fix environment/pkg config interaction * adding cfg dump property to pkgconfig: * fix rpc reconfig * fix spacing * cleanup * fix spacing * long line fix * fix spacing and lint * fix line length * cmake fix * environment fix * renaming after pynq since the driver stack relies on the pynq library - see pynq.io * update doc * adding parameterization to name * space * removing reg width * vta RPC * update doc on how to edit vta_config.json * fix path * fix path
Thierry Moreau committed
-
- 28 Jul, 2019 2 commits
- 27 Jul, 2019 1 commit
-
-
* fix tensor issue/commit in gemm * remove trailing spaces
Luis Vega committed
-
- 26 Jul, 2019 1 commit
-
-
* support for different inp/wgt bits, rewrote dot for clarity * [VTA] [Chisel] support for different inp/wgt bits, rewrote DotProduct for clarity * [VTA] [Chisel] support for different inp/wgt bits, rewrote DotProduct for clarity * change back to sim * fix index * fix index * fix indent * fix indent * fix indent * fix trailing spaces * fix trailing spaces * change to more descriptive name * matric->matrix * fix spacing * fix spacing & added generic name for dot * better parameter flow * spacing * spacing * spacing * update requirement (tested) for dot, spacing * function call convention * small edit
Benjamin Tu committed
-
- 23 Jul, 2019 1 commit
-
-
Luis Vega committed
-