Visible to Intel only — GUID: zgg1678993610638
Ixiasoft
1. FPGA AI Suite SoC Design Example User Guide
2. About the SoC Design Example
3. FPGA AI Suite SoC Design Example Quick Start Tutorial
4. FPGA AI Suite SoC Design Example Run Process
5. FPGA AI Suite SoC Design Example Build Process
6. FPGA AI Suite SoC Design Example Quartus® Prime System Architecture
7. FPGA AI Suite Soc Design Example Software Components
8. Streaming-to-Memory (S2M) Streaming Demonstration
A. FPGA AI Suite SoC Design Example User Guide Archives
B. FPGA AI Suite SoC Design Example User Guide Document Revision History
3.1. Initial Setup
3.2. Initializing a Work Directory
3.3. (Optional) Create an SD Card Image (.wic)
3.4. Writing the SD Card Image (.wic) to an SD Card
3.5. Preparing SoC FPGA Development Kits for the FPGA AI Suite SoC Design Example
3.6. Adding Compiled Graphs (AOT files) to the SD Card
3.7. Verifying FPGA Device Drivers
3.8. Running the Demonstration Applications
7.1.1. Yocto Recipe: recipes-core/images/coredla-image.bb
7.1.2. Yocto Recipe: recipes-bsp/u-boot/u-boot-socfpga_%.bbappend
7.1.3. Yocto Recipe: recipes-drivers/msgdma-userio/msgdma-userio.bb
7.1.4. Yocto Recipe: recipes-drivers/uio-devices/uio-devices.bb
7.1.5. Yocto Recipe: recipes-kernel/linux/linux-socfpga-lts_%.bbappend
7.1.6. Yocto Recipe: recipes-support/devmem2/devmem2_2.0.bb
7.1.7. Yocto Recipe: wic
Visible to Intel only — GUID: zgg1678993610638
Ixiasoft
3.8.1. Running the M2M Mode Demonstration Application
The M2M dataflow model uses the dla_benchmark demonstration application. The S2M bitstream supports both the M2M dataflow model and the S2M dataflow model.
You must know the host name of the SoC FPGA development kit. If you do not know the development kit host name, go back to Determining the SoC FPGA Development Kit IP Address before continuing here.
To run inference on the SoC FPGA development kit:
- Open an SSH connection to the SoC FPGA development kit:
- Start a new terminal session
- Run the following command:
build-host:$ ssh <devkit_hostname>
Where <devkit_hostname> is the host name you determined in Determining the SoC FPGA Development Kit IP Address.
Continuing the example from Determining the SoC FPGA Development Kit IP Address, the following command would open an SSH connection:build-host:$ ssh arria10-62747948036a.local
- In the SSH terminal, run the following commands:
export compiled_model=~/resnet-50-tf/RN50_Performance_b1.bin export imgdir=~/resnet-50-tf/sample_images export archfile=~/resnet-50-tf/<architecture file> cd ~/app export COREDLA_ROOT=/home/root/app ./dla_benchmark \ -b=1 \ -cm $compiled_model \ -d=HETERO:FPGA,CPU \ -i $imgdir \ -niter=8 \ -plugin ./plugins.xml \ -arch_file $archfile \ -api=async \ -groundtruth_loc $imgdir/TF_ground_truth.txt \ -perf_est \ -nireq=4 \ -bgr
where <architecture file> is one of the following files, depending on your development kit:- Agilex™ 7 FPGA I-Series Transceiver-SoC Development Kit
AGX7_Performance.arch
- Arria® 10 SX SoC FPGA Development Kit
A10_Performance.arch
- Agilex™ 7 FPGA I-Series Transceiver-SoC Development Kit
The dla_benchmark command generates output similar to the following example output for each step. This example output was generated using an Agilex™ 7 FPGA I-Series Transceiver-SoC Development Kit.
[Step 11/12] Dumping statistics report count: 8 iterations system duration: 288.8584 ms IP duration: 121.6040 ms latency: 136.0344 ms system throughput: 27.6952 FPS number of hardware instances: 1 number of network instances: 1 IP throughput per instance: 65.7873 FPS IP throughput per fmax per instance: 0.3289 FPS/MHz IP clock frequency: 200.0000 MHz estimated IP throughput per instance: 149.5047 FPS (500 MHz assumed) estimated IP throughput per fmax per instance: 0.2990 FPS/MHz [Step 12/12] Dumping the output values [ INFO ] Dumping result of Graph_0 to result.txt, result.bin, result_meta.json, and result_tensor_boundaries.txt [ INFO ] Comparing ground truth file /home/root/quartus_uplift/resnet-50-tf/sample_images/TF_ground_truth.txt with network Graph_0 top1 accuracy: 100 % top5 accuracy: 100 % [ INFO ] Get top results for "Graph_0" graph passed