* [VTA] Performance optimize, remove unnecessary contigious memory use. Issue: Uop maintain a cache vector to copy uop data into contigious DRAM memory for FPGA/Simulator use, but this cache vector not get clear after FPGA/Simulator core run, in Resnet18 case, if we printf the cache size in UopQueue::ReadBarrier function, we can saw such cache size keep increase, this would cause no use data copy and unnecessary contigous DRAM memory malloc. Analysis: This issue caused by not clear cache_ vector when do uop_queue_.Reset(). Solution: Override BaseQueue Reset function in UopQueue and add cache_ clear logic. * address review comments, remove spacing.
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
apps | Loading commit data... | |
config | Loading commit data... | |
hardware | Loading commit data... | |
include/vta | Loading commit data... | |
python/vta | Loading commit data... | |
scripts | Loading commit data... | |
src | Loading commit data... | |
tests | Loading commit data... | |
tutorials | Loading commit data... | |
README.md | Loading commit data... |