1. 27 Oct, 2019 1 commit
  2. 24 Oct, 2019 1 commit
    • TensorCore Support using Intrinsic (#4136) · 324a9607
      * add tensor core support
      
      * avoid memory bank conflict
      
      * fix thread sync & better performance
      
      * better performance
      
      * add schedule test for conv2d
      
      * extend into BatchMatMul
      
      * support config fragment shape and layout using intrinsic
      
      * add TensorCore tutorial
      
      * add int support and fix lint
      
      * address comment
      
      * add 32*16*8 TensorCore test
      
      * fix wmma include logic
      Siyuan Feng committed
  3. 10 Oct, 2019 1 commit
  4. 08 Oct, 2019 1 commit
  5. 28 Sep, 2019 1 commit
  6. 13 Sep, 2019 1 commit
  7. 09 Sep, 2019 1 commit
  8. 07 Sep, 2019 1 commit
    • [VTA] Support TLPP in function simulator. (#3555) · 50c4546f
      * [VTA] Support TLPP in function simulator.
      Issue:
      currently vta function simulator just doing serialized instruction
      execution, the dependency logic of runtime ISA which use for task
      level pipe line parallelism can not get verified by function simulator.
      
      Solution:
      make the simulator driver to be multiple thread and support TLPP.
      
      Benefit:
      TLPP support VTA function simulator would make VTA logic testing/debug
      /change more easy.
      
      replace boost lockfree queue
      
      add configure control for simulator tlpp enable or disable.
      
      change code tyle into google style.
      
      Wrap queue read/write and sync logic to make function call more simple.
      
      Add some comments.
      
      Remove MT logic, change into Single thread mode.
      
      address review comments.
      
      code style change to match google code style and add comments.
      
      add cmake macro to enable/disable simulator tlpp logic.
      
      submodule update.
      
      correct file name mentioned in comments.
      
      * remove USE_VTA_FSIM_TLPP.
      Hua Jiang committed
  9. 05 Sep, 2019 3 commits
    • [VTA][TOPI] Conv2d transpose (deconvolution) operator support (#3777) · 23c22812
      * initial conv2d_transpose
      
      * correct select operator
      
      * cleanup
      
      * fix
      
      * fix correcness check
      
      * conv2d transpose declaration fix
      
      * autotvm conv2d_transpose tuning script
      
      * ir pass fix
      
      * fix tuning script
      
      * deriving params from env, adding bias
      
      * removing bias comp from deconvolution
      
      * lint
      
      * fix
      
      * lint
      
      * lint
      
      * turning off cpu
      
      * lint, ops
      
      * lint
      
      * import fix
      
      * removing hard coded values
      
      * lint
      Thierry Moreau committed
    • [VTA][Relay] Extending Vision model coverage compilation for VTA (#3740) · 028f47ce
      * adding support for graphpack over multiply op
      
      * increasing resnet model coverage
      
      * fix indentation
      
      * lint
      
      * moving recursion limit fix into graphpack pass
      
      * moving recursionlimit to relay init
      
      * pooling on NCHWnc format
      
      * adding more models
      
      * deploy_resnet_on_vta.py
      
      * trailing line
      
      * generalizing to vision models
      
      * merge conflicts
      
      * fix, apply quantization to VTA only
      
      * improving comments
      
      * trimming models that have runtime issues for the moment
      
      * lint
      
      * lint
      
      * lint
      Thierry Moreau committed
    • [VTA] de10-nano driver (#3394) · 734df8d5
      * rework;
      
      * `de10-nano` -> `de10nano`;
      
      * fix compilation error;
      
      * bug fix;
      
      * Update install.md
      
      * Update install.md
      
      * Update install.md
      
      * update with current runtime;
      
      * add debug messages;
      
      * bug fix in cma kernel module;
      Liangfu Chen committed
  10. 04 Sep, 2019 2 commits
  11. 03 Sep, 2019 1 commit
  12. 02 Sep, 2019 1 commit
  13. 01 Sep, 2019 1 commit
  14. 29 Aug, 2019 1 commit
    • [VTA] Fix RewriteForceSerial Function logic issue. (#3854) · 187600da
      Issue:
      RewriteForceSerial is a debug function to force instructions
      to be serialize instead of parrallel running, by doing so we
      can isolate some parallel problem or do performance compare
      between parallel and serialize. But this function have some
      problem, once get enabled by set debug flag, vta would stuck
      when running on pynq board.
      
      Analysis:
      once enable RewriteForceSerial, the dependency logic is different
      with default one, but we still use same logic to generate FINISH
      and other logic, this would cause dead lock.
      
      Solution:
      give a different dependency settings when enable RewriteForceSerial.
      Hua Jiang committed
  15. 27 Aug, 2019 1 commit
  16. 26 Aug, 2019 1 commit
    • [VTA][TSIM] Introduce Virtual Memory for TSIM Driver (#3686) · 92b6ca71
      * initial virtual memory;
      
      * initial integration;
      
      * include the header file in cmake;
      
      * implement allocation with virtual to logical address mapping;
      
      * virtual memory for tsim_driver;
      
      * implement the missing memory release function;
      
      * readability improvement;
      
      * readability improvement;
      
      * address review comments;
      
      * improved robustness in virtual memory allocation;
      
      * remove VTA_TSIM_USE_VIRTUAL_MEMORY macro and use virtual memory for tsim by default;
      
      * link tvm against vta library;
      
      * merge with master
      
      * build virtual memory system without linking tvm against vta;
      
      * minor change;
      
      * reuse VTA_PAGE_BYTES;
      
      * using DRAM class from sim_driver as VirtualMemoryManager;
      
      * satisfy linter;
      
      * add comments in code;
      
      * undo changes to Makefile
      
      * undo changes to Makefile
      
      * retrigger ci;
      
      * retrigger ci;
      
      * directly call into VirtualMemoryManager::Global()
      Liangfu Chen committed
  17. 18 Aug, 2019 1 commit
  18. 14 Aug, 2019 2 commits
  19. 13 Aug, 2019 4 commits
    • syntax fix (#3765) · 4c01e8ee
      Benjamin Tu committed
    • [VTA][Chisel] run all unittests by default (#3766) · e9782030
      * [VTA][Chisel] run all unittests by default
      
      * better naming
      
      * add generated unittest folder to clean rule
      Luis Vega committed
    • [VTA][TSIM][Build] Towards TSIM CI testing (#3704) · e518fe1c
      * building TSIM specific library along with fast simulator to quickly switch between dlls
      
      * cmake controlled TSIM libraries
      
      * always build tsim driver in either simulation modes
      
      * build DLLs based on CMAKE flags
      
      * updating the jenkinsfile
      
      * small restructuring
      
      * reducing the cmake flags
      
      * update instructions
      
      * reverting to 3 flags
      
      * update Jenkinsfile
      
      * adding new line
      
      * enabling TSIM unit and integration tests
      
      * fix description
      
      * temporarily disabling task_python_vta tests in CPU Build stage
      
      * move CPU tests in unit test stage
      
      * stage  reorg
      
      * better make
      
      * disabling TSIM tests for now
      
      * reverting some restructuring
      
      * fix
      Thierry Moreau committed
    • [VTA] [Chisel] Improved Data Gen, Added ALU Test (#3743) · 5f9c5e43
      * added alutest
      
      * fix indent
      
      * name change for cycle
      
      * improved data gen and infra
      
      * added alutest
      
      * fix indent
      
      * name change for cycle
      
      * improved data gen and infra
      
      * fix space
      
      * fix indent
      
      * fixes
      
      * aluRef
      
      * fix randomarary
      
      * add
      
      * Revert "add"
      
      This reverts commit 87077daebbe055dee11f80e37da3a6291138e0f0.
      
      * Revert "fix randomarary"
      
      This reverts commit df386c1e660eb6ebcff1a1f905610573676f1589.
      
      * Revert "aluRef"
      
      This reverts commit 8665f0d4a7b12b796b2cb1ca6bf9cfe5613ee389.
      
      * should fix dlmc-core
      Benjamin Tu committed
  20. 08 Aug, 2019 1 commit
  21. 07 Aug, 2019 1 commit
  22. 06 Aug, 2019 1 commit
  23. 03 Aug, 2019 1 commit
  24. 31 Jul, 2019 1 commit
    • [VTA] VTA Compilation Script for Intel FPGA (#3494) · 83591aa5
      * initial compilation script for chisel-vta;
      
      * replace tabs with spaces;
      
      * compile script for de10-nano;
      
      * remove generated verilog source code;
      
      * remove `altsource_probe`, `debounce`, `edge_detect` ip;
      
      * replace quartus project files with a single tcl script;
      
      * Update install.md
      
      * improved makefile-based compilation script;
      
      * complete makefile-based compilation of chisel-vta for de10-nano;
      
      * install quartus;
      
      * conversion to .rbf file;
      
      * document chisel-vta compilation process for de10-nano;
      
      * rename generated bitstream file;
      
      * download and extract custom ip for de10-nano;
      
      * minor change
      
      * minor change
      
      * fix indentation;
      
      * bug fix;
      
      * improved robustness in makefile;
      
      * clean up;
      
      * add `.sdc .ipx .qsys` allowance in jenkins;
      
      * add ASF header;
      
      * add ASF header;
      
      * remove IntelShell.scala, update vta_hw.tcl, clean up Makefile & soc_system.qsys;
      
      * add ASF header;
      
      * keep sources compact;
      
      * keep sources compact;
      
      * it's not necessary now
      
      * AXI4LiteClient -> AXI3Client for IntelShell
      
      * remove connection to fpga_only_master;
      
      * a few important bug fix: wire reset pin, and set host_r_last to high
      
      * remove intel specific interface definition;
      
      * add NO_DSP option in Makefile;
      
      * AXI4Lite is not used in IntelShell;
      
      * minor fix: disable dsp and use logic instead;
      
      * quartus version change: 18.0 -> 18.1
      
      * remove altera related statement;
      
      * compose compile_design.tcl
      
      * initial tcl script for soc_system generation;
      
      * remove .qsys file;
      
      * remove unused;
      
      * .qsys can be generated by tcl script;
      
      * remove hps_io and shrink size of soc_system;
      
      * integrate into makefile;
      
      * version change: 18.0 -> 18.1
      
      * add sample config file for de10-nano;
      
      * parameterize DEVICE and PROJECT_NAME
      
      * remove extra lines;
      
      * brief description on flashing sd card image for de10-nano
      
      * docs on building additional components
      
      * parameterize DEVICE and DEVICE_FAMILY
      
      * parameterize DEVICE and DEVICE_FAMILY
      
      * parameterize DEVICE and DEVICE_FAMILY
      
      * de10-nano -> de10nano
      
      * minor change
      
      * add comment in code and document in order to address review comments;
      Liangfu Chen committed
  25. 30 Jul, 2019 2 commits
  26. 29 Jul, 2019 2 commits
    • [VTA] Refactor to increase platform coverage (Ultra96 etc.) (#3496) · f55609b4
      * hardware refactor for increased FPGA coverage, small optimizations
      
      * fix header
      
      * cleaning up parameters that won't be needed for now
      
      * streamlining makefile, and simplifying tcl scripts
      
      * moving parameter derivation into pkg_config.py, keeping tcl scripts lightweight
      
      * refactoring tcl script to avoid global variables
      
      * deriving AXI signals in pkg_config.py
      
      * unifying address map definition for hardware and software drivers
      
      * single channel design for ultra96 to simplify build
      
      * enable alu by default, no mul opcode for now
      
      * hardware fix
      
      * new bitstream; vta version
      
      * avoid error when env variable is not set
      
      * ultra96 cleanup
      
      * further cleaning up tcl script for bitstream generation
      
      * preliminary rpc server support on ultra96
      
      * rpc server tracker scripts
      
      * ultra96 ldflag
      
      * ultra96 support
      
      * ultra96 support
      
      * cleanup line
      
      * cmake support for ultra96
      
      * simplify memory instantiation
      
      * cleaning up IP parameter initialization
      
      * fix queue instantiation
      
      * 2019.1 transition
      
      * fix macro def
      
      * removing bus width from config
      
      * cleanup
      
      * fix
      
      * turning off testing for now
      
      * cleanup ultra96 ps insantiation
      
      * minor refactor
      
      * adding comments
      
      * upgrading to tophub v0.6
      
      * model used in TVM target now refers to a specific version of VTA for better autoTVM scheduling
      
      * revert change due to bug
      
      * rename driver files to be for zynq-type devices
      
      * streamlining address mapping
      
      * unifying register map offset values between driver and hardware generator
      
      * rely on cma library for cache flush/invalidation
      
      * coherence management
      
      * not make buffer packing depend on data types that can be wider than 64bits
      
      * refactor config derivation to minimize free parameters
      
      * fix environment/pkg config interaction
      
      * adding cfg dump property to pkgconfig:
      
      * fix rpc reconfig
      
      * fix spacing
      
      * cleanup
      
      * fix spacing
      
      * long line fix
      
      * fix spacing and lint
      
      * fix line length
      
      * cmake fix
      
      * environment fix
      
      * renaming after pynq since the driver stack relies on the pynq library - see pynq.io
      
      * update doc
      
      * adding parameterization to  name
      
      * space
      
      * removing reg width
      
      * vta RPC
      
      * update doc on how to edit vta_config.json
      
      * fix path
      
      * fix path
      Thierry Moreau committed
  27. 28 Jul, 2019 2 commits
  28. 27 Jul, 2019 1 commit
  29. 26 Jul, 2019 1 commit
    • [VTA] [Chisel] support for different inp/wgt bits, rewrote DotProduct for clarity (#3605) · 87e18a44
      * support for different inp/wgt bits, rewrote dot for clarity
      
      * [VTA] [Chisel] support for different inp/wgt bits, rewrote DotProduct for clarity
      
      * [VTA] [Chisel] support for different inp/wgt bits, rewrote DotProduct for clarity
      
      * change back to sim
      
      * fix index
      
      * fix index
      
      * fix indent
      
      * fix indent
      
      * fix indent
      
      * fix trailing spaces
      
      * fix trailing spaces
      
      * change to more descriptive name
      
      * matric->matrix
      
      * fix spacing
      
      * fix spacing & added generic name for dot
      
      * better parameter flow
      
      * spacing
      
      * spacing
      
      * spacing
      
      * update requirement (tested) for dot, spacing
      
      * function call convention
      
      * small edit
      Benjamin Tu committed
  30. 23 Jul, 2019 1 commit