Intel® oneAPI HPC Toolkit
Deliver Fast Applications That Scale across Clusters
Optimized Tools for High-Performance Computing
High-performance computing (HPC) is at the core of AI, machine learning, and deep learning applications. Intel® oneAPI HPC Toolkit delivers what developers need to build, analyze, optimize, and scale HPC applications with the latest techniques in vectorization, multithreading, multi-node parallelization, and memory optimization.
This toolkit includes powerful data-centric libraries and advanced analysis tools.
Download the Toolkit
Deliver fast C++, Fortran, OpenMP*, and MPI applications that scale.
Purchase with Priority Support
Submit questions, problems, and other technical support issues through direct and private interactions with Intel’s engineers.
Features
Simplify implementation of HPC applications on CPUs and accelerators with Intel’s industry-leading compiler technology and libraries.
Quickly gauge how your application is performing, how resource use impacts your code, and where it can be optimized to ensure faster cross-architecture performance.
Scale
Deploy applications and solutions across shared memory and distributed memory (such as clusters) computing systems using the included standards-driven Intel® MPI Library and benchmarks, MPI analyzer, and cluster tuning tools.
What's Included
Intel® Fortran Compiler
Compile and optimize standard Fortran code for CPU and GPU architectures.
Intel® MPI Library
Deliver flexible, efficient, scalable cluster messaging on CPU and GPU architectures and a variety of network fabrics.
Intel® oneAPI DPC++/C++ Compiler
Compile and optimize standard C++ and SYCL* code for CPU, GPU, and FPGA target architectures.
Intel® DPC++ Compatibility Tool
Migrate legacy CUDA* code to open multiarchitecture SYCL code with this assistant.
Intel® VTune™ Profiler
Find and optimize performance bottlenecks across CPU, GPU, and FPGA systems.
Intel® Advisor
Design code for efficient vectorization, threading, and offloading to accelerators.
Intel® Distribution for GDB*
Enable deep, system-wide debugging of SYCL, C, C++, and Fortran code.
Intel® oneAPI DPC++ Library (oneDPL)
Speed up data parallel workloads with these key productivity algorithms and functions.
Intel® oneAPI Threading Building Blocks (oneTBB)
Simplify parallelism with this advanced threading and memory-management template library.
Intel® oneAPI Math Kernel Library (oneMKL)
Accelerate math processing routines that include matrix algebra, fast Fourier transforms (FFT), and vector math.
Intel® Integrated Performance Primitives
Speed up performance of imaging, signal processing, data compression, cryptography, and more.
Intel® Cryptography Primitives Library
These secure, fast, lightweight building blocks for cryptography are optimized for Intel CPUs.
Intel® oneAPI Data Analytics Library (oneDAL)
Boost machine learning and data analytics performance.
Intel® oneAPI Deep Neural Network Library (oneDNN)
Develop fast neural networks on Intel CPUs and GPUs with performance-optimized building blocks.
Intel® oneAPI Collective Communications Library (oneCCL)
Implement optimized communication patterns to distribute deep learning model training across multiple nodes.
FPGA Support Package for the Intel® oneAPI DCP++/C++ Compiler (separate download required)
Accelerate your register transfer level (RTL) development with SYCL high-level synthesis (HLS) or program FPGA accelerators to speed up specialized, data-centric workloads. This requires installing the Intel® oneAPI Base Toolkit (Base Kit).
Bundles
In addition to the full toolkit, download smaller bundles for specific development use cases.
This bundle is for C++ and SYCL developers who want to compile, debug, and use the most popular performance libraries in the Base Kit for Intel CPUs and GPUs. It includes:
- Intel® Distribution for GDB*
- Intel® DPC++ Compatibility Tool
- Intel® oneAPI DPC++/C++ Compiler
- oneDPL
- oneMKL
- oneTBB
Fortran developers with numeric workloads can use this bundle to compile, debug, and use the most popular performance libraries in the Intel oneAPI HPC Toolkit for Intel CPUs and GPUs.
- Intel® Fortran Compiler
- Intel Distribution for GDB
- Intel MPI Library
- oneMKL
Get Started
Get Your System Ready
Check that you have the supported hardware and that your system meets the minimum requirements to install the HPC Kit.
Download the HPC Kit
After downloading, follow the Get Started Guide to configure your system and run your first sample.
Get Started Guide: Linux* | Windows* | Containers
Next Steps
Access samples or run your own workloads with the help of tutorials and training.
Documentation & Code Samples
Code Samples
Learn how to access oneAPI code samples in a tool command line.
- Particle Diffusion
- Matrix Multiplication with CPUs and GPUs
- ISO3DFD Finite Difference Stencil Kernel - SYCL
- Mandelbrot
- ISO2DFD Finite Difference Stencil Kernel - SYCL
- 1d HeatTransfer Finite Difference Stencil Kernel - SYCL
Training
OpenMP* Offload Basics [self-paced]
Learn the fundamentals of using OpenMP offload directives to target GPUs, as well as using Intel® C, C++, and Fortran Compilers through hands-on practice in this guided learning path.
Essentials of SYCL [self-paced]
Learn the basics of SYCL for heterogeneous computing (CPU, GPU, FPGA, and AI accelerators) using live sample code on the Intel® Developer Cloud.
Intel® oneAPI Math Kernel Library (oneMKL) Essentials [self-paced]
Learn how to use oneMKL and its functions to create performant applications and speed up computations with low-level math routines.
Specifications
CPUs:
- Intel® Xeon® processor family
- Intel® Core™ processor family
- Other processors that are compatible with Intel® 64 architecture
GPUs:
- Intel® UHD Graphics for 11th generation Intel processors or newer
- Intel® Iris® Xe graphics
- Intel® Arc™ graphics
- Intel® Server GPU
- Intel® Data Center GPU Flex Series
- Intel® Data Center GPU Max Series
- NVIDIA* or AMD* GPUs using plug-ins from Codeplay*
Languages:
- C++ with SYCL
- Fortran
Note Requires Microsoft Visual Studio* on Windows
Operating systems:
- Windows
- Linux
Development environments:
- Compatible with compilers from Microsoft*, GCC*, Intel, and others that follow established language standards
- Windows: Microsoft Visual Studio
- Linux: Eclipse*
Distributed environments:
- MPI
OpenFabrics Interface* (OFI) framework implementation supporting the following:
- InfiniBand*
- iWARP, RDMA over Converged Ethernet (RoCE)
- Amazon Web Services Elastic Fabric Adapter (AWS EFA)*
- Cornelis Networks*
- Ethernet, IP over InfiniBand (IPoIB), IP over Intel OPA
For more information, see the system requirements.
Get Help
Your success is our success. Access these support resources when you need assistance.
For additional help, see our general oneAPI Support.
Stay In the Know on All Things CODE
Sign up to receive the latest tech articles, tutorials, dev tools, training opportunities, product updates, and more, hand-curated to help you optimize your code, no matter where you are in your developer journey. Take a chance and subscribe. You can change your mind at any time.