Understand the Workflow
Follow this workflow to use Intel® VTune™ Profiler to identify and analyze performance bottlenecks in your serial or parallel application. This tutorial guides you through the workflow using a sample application named matrix.
Prerequisites
Download these Intel software tools to your Linux system:
Intel® VTune™ Profiler version 2025.0 or newer
You can get both of these tools in the Intel® oneAPI Base Toolkit.
This tutorial uses the Intel® oneAPI DPC++/C++ Compiler to establish a common baseline for analysis and track performance gain. Your choice of a different compiler may change your results in this workflow
Workflow
To find and fix performance issues in the matrix sample application,
Establish a baseline for application performance.
Identify a bottleneck in the matrix application.
Eliminate memory access bottlenecks, if any.
Assess the performance improvement.
Address vectorization problems, if any.
Identify next steps
See the performance gain