Application Performance Snapshot User Guide for Linux* OS

ID 772048
Date 10/31/2024
Public

Common Analyses

This section describes report types that are available for all applications, whether they use MPI or not.

Counters Report

Key:-o [--counters]

This report displays the minimum, maximum, and average values of all collected MPI and hardware metrics. This report also shows total values for metrics, where applicable. This report also shows the node or rank where the minimum or maximum value was observed next to the value.

Example:

| Counters, Memory and Disk usage statistics for all Ranks |--------------------------------------------------------- Computation: 40.11% MPI : 59.89% | Virtual Memory Usage per Rank: Total: 134246164.66 MB (all ranks) MIN: 33560779.87 MB (rank 3) MAX: 33562304.46 MB (rank 0) AVG: 33561541.16 MB | Resident Memory Usage per Rank: Total: 1450.39 MB (all ranks) MIN: 129.72 MB (rank 1) MAX: 593.68 MB (rank 0) AVG: 362.60 MB | MPI Time per Rank: Total: 13.48 sec (all ranks) MIN: 1.12 sec (rank 2) MAX: 6.01 sec (rank 1) AVG: 3.37 sec [...] | SP GFLOPS Total: 0.00 MIN: 0.00 (node s011-n005) MAX: 0.00 (node s011-n005) AVG: 0.00 [...]

Node Topology Report

Key:--node-topology

This report shows the association between ranks, nodes, and PCIe devices in the collected data. When the rank filter --rank is applied, the report displays node information for nodes where the specified rank was executed.

Example:

| Node topology |--------------------------------------------------------- Node s011-n004: ranks: 0-1 devices: Bridge: Intel Corporation Device 0x2020 0000:00:00.0 Display controller: Intel Corporation Device 0x4905 0000:1c:00.0 Display controller: Intel Corporation Device 0x4905 0000:6a:00.0 Mass storage controller: Intel Corporation Device 0xfaf0 0000:b3:00.0 Node s011-n005: ranks: 2-3 devices: Bridge: Intel Corporation Device 0x2020 0000:00:00.0 Display controller: Intel Corporation Device 0x4905 0000:1c:00.0 Display controller: Intel Corporation Device 0x4905 0000:6a:00.0 Mass storage controller: Intel Corporation Device 0xfaf0 0000:b3:00.0

Metrics Report

Key:--metrics arg

This report shows a table with performance metrics for each rank, node, and PCIe device.

You can specify the exact metrics to be displayed in the table.

Use --metrics=? to get the list of metrics available in the collected data.

Use --metrics=all to display all available metrics.

Use --metrics="Metric 1, Metric 2, Metric 3" to display a specific set of metrics.

Example:

| Metric Table |--------------------------------------------------------- Rank Node Name Device Type Device Name Metric Name Metric Value 0 s011-n004 N/A N/A MPI Time, usec 1.37737e+06 N/A s011-n004 N/A N/A Elapsed Time, sec 6.72061 N/A s011-n004 GPU Display controller: Intel Corporation Device 0x4905 0000:1c:00.0 Inbound PCIe Write, MB/s 103.57 N/A s011-n004 Other Mass storage controller: Intel Corporation Device 0xfaf0 0000:b3:00.0 Inbound PCIe Write, MB/s 29.9996 N/A s011-n004 Other Bridge: Intel Corporation Device 0x2020 0000:00:00.0 Inbound PCIe Write, MB/s 3.55524 N/A s011-n004 GPU Display controller: Intel Corporation Device 0x4905 0000:6a:00.0 Inbound PCIe Write, MB/s 0