Comments (11)
Acutally, beta_offset and weight_offset are the address offset for different layers. INTER_WIDTH is a constant value that similar with InputQ or OutputQ to indicate the fraction bit.
from yolov2_xilinx_fpga.
@dhm2013724 Thank u for the reply.
how do u find the beta and weight offset values ? Also do u think its possible to change this to use INT8 weights and biases and int8 operations?
Also what are *Memory_bottom, *Memory_top and *Memory_buf ? whats the difference between these?
It will be a great help!
Thnaks in advanve!
from yolov2_xilinx_fpga.
There are nearly 30 layers in YOLOv2 model. For convolutional layer, it needs input/output feature maps and kernel weights. In software version, I just used two continued buffers to store feature maps and weights for simulating memory access operations. Like stack and heap in software, I manually set the push/pop address in two buffers for different layers.
Memory_buf means the memory buffer for feature map, and Memory_bottom & Memory_top are the upper and lower limitation for stack/heap.
If you want to use INT8 percison, you should manually concat 8bit data to 32/64 or other datawidth for data transmission between ip and memory.
from yolov2_xilinx_fpga.
@dhm2013724 thank u sooo much for the reply.
If you want to use INT8 percison, you should manually concat 8bit data to 32/64 or other datawidth for data transmission between ip and memory.
is that the method you use now for 16bit weight ? where do I have to change ? cnn.cpp and yolov2.h right ? Do I have to change the vivado design ?
Thanks in advance!
from yolov2_xilinx_fpga.
@dhm2013724
cnn/cnn/solution1/sim/verilog/AESL_axi_master_DATA_BUS.v: Read request address 1040 exceed AXI master DATA_BUS array depth: 1024
$finish called at time : 5265 ns : File "/home/thilanka/Documents/Documents/github/fpga/vivado_tests/cnn/cnn/solution1/sim/verilog/AESL_axi_master_DATA_BUS.v" Line 700
quit
I am getting this error in the rtl co simulation. what can be wrong here ?
Also this is the output I am getting in the c simulation. I do not understand why I am getting like this.
please guide me.
Thanks in advance!
from yolov2_xilinx_fpga.
It depends on the actual design. For example, if you want to use INT8 percision and the datawidth of the data transmission interface is 64bit, you have to wait for 8 continued datas and concat these 8 datas to one 64-bit data for transmission. In accelerator design, you should modify the input\output module. If further optimizing currecnt design, I suggest to use AXIStream interface with SG DMA.
from yolov2_xilinx_fpga.
@dhm2013724
cnn/cnn/solution1/sim/verilog/AESL_axi_master_DATA_BUS.v: Read request address 1040 exceed AXI master DATA_BUS array depth: 1024 $finish called at time : 5265 ns : File "/home/thilanka/Documents/Documents/github/fpga/vivado_tests/cnn/cnn/solution1/sim/verilog/AESL_axi_master_DATA_BUS.v" Line 700 quitI am getting this error in the rtl co simulation. what can be wrong here ?
Also this is the output I am getting in the c simulation. I do not understand why I am getting like this.
please guide me.Thanks in advance!
HLS has told you, the depth of AXI master is too small, you should use bigger depth. For RTL-cosimulation, I suggest simulation single layer for sparing time. If C-simulation is ok and set the big enough depth, maybe the accelerator structure has somethings wrong, you should look at the wave to find the bug module.
from yolov2_xilinx_fpga.
@dhm2013724 C cimulation doesnt give any errors but gives wrong results as shown in the image I posted. It doesnt predict correctly. Why is that ? what can be wrong ?
HLS has told you, the depth of AXI master is too small, you should use bigger depth
This is the AXI depth right ? This is set to 512 by you in the directive. Why do you thinkit does not work with mine. I didnt change anything. ( I am using the old version of your code here)
Thanks in advance!
from yolov2_xilinx_fpga.
In software version, I just used two continued buffers to store feature maps and weights for simulating memory access operations
@dhm2013724 @clancylea
are main.cpp and yolov2.h files testbench files for the cnn.cpp file for vivado_hls simulation. because without testbench files I can not run cnn.cpp file in xilinx vivado_hls. so I set cnn.cpp as the main file and main.cpp, yolov2.h and weight files as test bench files in the vivado_hls project. Am i doing it correctly ? Am I doing anything wrong? please guide me.
Thanks in advance!
from yolov2_xilinx_fpga.
@dhm2013724 what is the difference between cnn.cpp(hardware code) and yolov2.h, main.cpp(software code)? because these files contain the same functions and codes. Please guide me. Also is there a way to contact you other than github. maybe email? my email is - [email protected]
Thanks in advance!
Thank you.
from yolov2_xilinx_fpga.
@dhm2013724 what are the improvements you have made from your first version(n=2, m=32) to the current version.
I see that you have added some parts with burst lengths, added a eval part and have changed the weight and input load parts. Whats the difference between anti-reorg and reorg version. What is weight reorg means?
Also what do u mean by change the hardware and generate new weights? Does that mean that the weights change with the hardware ?
Also if you want to use this for tiny yolov2 how to use the weights ? can you provide the files for tiny yolov2 ?
Thanks in advance!
from yolov2_xilinx_fpga.
Related Issues (20)
- 还想请问一下,有没有大佬实现了摄像头的连接与数据传输处理,想要交流一下 HOT 2
- .pth文件转.bin
- zcu104复现
- 关于硬件的reorg模块 HOT 3
- 关于HLS输出的IP的端口问题,端口数与图片不匹配 HOT 7
- 关于int16的Vivado Block Design连线
- 导入IP并完成连线后,implementation耗时过长的问题(int16版) HOT 1
- 关于量化为int16后生成的bin文件问题 HOT 5
- About "SDK for Zedboard" is operating on which OS, Windows or Ubuntu? HOT 1
- makefile生成的Yolov2复制到SD卡,在板子上./Yolov2运行之后,运行到Layer[ 0]就停止了,请问是什么原因呢? HOT 8
- 更换权重后图像识别紊乱
- 使用Zedboard,精度为int16,怎样将预测图片直接输出在屏幕上 HOT 3
- 请问:xdc约束文件是否必要?
- My ip block not suitable zcu102 HOT 1
- Custom Model
- pynq推理卡死及推理出固定的错误结果或无识别结果。Python is stuck in inference .Get a consistently wrong result or even nothing detected. HOT 6
- zcu104 board design
- Access to the papers, it is behind a paywall.
- Petalinux problem
- Yolov8
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from yolov2_xilinx_fpga.