Intel® FPGA AI Suite: Compiler Reference Manual

ID 768972
Date 7/03/2023
Public

A newer version of this document is available. Customers should click here to go to the newest version.

Document Table of Contents

3.2. Estimating the Performance of a Graph

To estimate the performance of a graph on an architecture, use the --fanalyze-performance dla_compiler command option.

The dla_compiler command compiles the graph for the specified architecture to estimate its performance.

The list of required and optional arguments includes all the required and optional arguments from Compiling a Graph . In addition, the following options are specific to estimating graph performance:

Option

Description

--fanalyze-performance [Required] Enables the performance estimator.
--fassumed-fmax-core= <assumed fMAX> [Optional] Specifies the assumed fMAX of the compiled Intel® FPGA AI Suite IP.

The performance estimator does not have the ability to estimate fMAX of a given IP parameterization, nor does it know which speed grade the IP targets. Typically, the IP achieves 300 MHz or higher on a C2 Intel® Arria® 10 device.

The default fMAX depends on the device family:
  • Intel® Arria® 10 - 265 MHz
  • Intel Agilex® 7 - 400 MHz
  • Intel® Cyclone® 10 GX - 265 MHz
  • Intel® Stratix® 10 - 265 MHz
--fdump-performance-report [Optional] An optional output file for the performance estimate, otherwise the performance summary is displayed on the terminal.

The simplest dla_compiler command format for estimating the performance of a graph is as follows:

dla_compiler \
   --network-file <path to graph.xml> \
   --march <path to .arch file> \
   --fanalyze-peformance

Example Command

dla_compiler \
   --network-file ResNet50.xml ResNet101.xml \
   --march $COREDLA_ARCH/example_architectures/A10_Generic.arch \
   --fanalyze-performance \
   --fassumed-fmax-core=300