AN 1003: Multi Memory IP System Resource Planning: for Intel Agilex® 7 M-Series FPGAs

ID 788295
Date 11/22/2023
Public
Document Table of Contents

4.11. Planning NoC and Memory Solution Clocks

The Intel Agilex® 7 M-Series FPGA device architecture offers a variety of clocking schemes. Understanding these clocking schemes for your application allows you to plan and optimize clocking usage for the best performance.

Well informed planning and placement of each clock domain can improve system level performance.

The hard memory NoC, memory Intel FPGA IP, and associated features provide:

  • Solutions for efficient clock resource utilization
  • Support for an increased number of design clocks
  • Improved clock characteristics for performance and power

This section covers only the clocking alternatives for data flow to and from the FPGA core. This section does not cover reference and PLL clock requirements for the hard memory NoC and memories. You can find more information on these subjects in External Memory Interfaces Intel Agilex 7 M-Series FPGA IP User Guide, High Bandwidth Memory (HBM2E) Interface Intel Agilex 7 FPGA IP User Guide, and Intel Agilex 7 M-Series FPGA Network-on-Chip (NoC) User Guide.

Summary of Clocking Schemes provides a summary of the available clocking schemes, including the supported frequencies and considerations. The frequency is for a single initiator. For complex designs, the supported frequency depends on your design timing closure.

Table 12.  Summary of Clocking Schemes
Clocking Configuration Read/Write Data Width Write Data Path User Clock (MHz) Read Data Path User Clock (MHz) Considerations
Clocking scheme without fabric NoC 256b - 256b <=660 (-1 speed grade) <=660 (-1 speed grade)
  • Limited read bandwidth.
  • Timing closure difficulties close to maximum allowed frequency for complex design.
  • An area efficient option.
<=630 (-2 speed grade) <=630 (-2 speed grade)
<=450 (-3 speed grade) <=450 (-3 speed grade)
Symmetric data path for both read and write with same clock 512b – 512b >=350 (-1 speed grade)
  • Can saturate HBM read bandwidth for -2 and -3 device speed grade.
  • Uses the Fabric NoC which requires added area utilization in core.
>=350 (-2 speed grade)
>=250 (-3 speed grade)
Asymmetric width read path and write path driven by common clock 512b – 256b >=350 to <= 660 (-1 speed grade)
  • Can saturate read bandwidth.
  • Uses the Fabric NoC which requires additional area utilization in core. You save some area relative to the symmetric clocking scheme because there is no requirement for clock crossing on 512 data paths.
  • Difficult timing closure.
  • Does not require clock crossing in core logic.
>=350 to <= 630 (-2 speed grade)
>=250 to <= 450 (-3 speed grade)
Asymmetric width read path and write path driven by separate clocks 512b – 256b <=660 (-1 speed grade) <=350 (-1 speed grade)
  • Can saturate read bandwidth.
  • Fabric NoC requires additional area utilization in core. You save some area relative to the above clocking scheme because there is no requirement for clock crossing on 512 data paths.
  • Depending on system, additional PLL is required.
  • Easier to close timing due to low frequency on read clock.
  • Write bandwidth is not saturated.
<= 630 (-2 speed grade) >=350 (-2 speed grade)
<= 450 (-3 speed grade) >=250 (-3 speed grade)
Note: In the ‘Symmetric data path for both read and write with the same clock’ clocking scheme, you must supply noc_bridge_fabric_clk through the NoC Initiator IP. This clock supports frequencies up to 660 MHz for -1 device speed grade, up to 630 MHz for -2 device speed grade, and up to 450 MHz for -3 device speed grade.