Intel® oneAPI Base Toolkit Release Notes

ID 835744
Updated 10/11/2024
Version 2024.2.0
Public

author-image

By

System Requirements

Please see Intel oneAPI Base Toolkit System Requirements

Intel® oneAPI Base Toolkit Major Component Versions

Please visit Intel® oneAPI Toolkit and Component Versioning Schema for semantic versioning schema detail. 

The following table contains major versions of components in the latest oneAPI Base Toolkit 2024.2.0

Component Name Version
Intel® oneAPI DPC++ Compiler 2024.2.0
Intel® oneAPI DPC++ Library 2022.6.0
Intel® DPC++ Compatibility Tool 2024.2.0
Intel® oneAPI Math Kernel Library 2024.2.0
Intel® Distribution for GDB* 2024.2.0
Intel® VTune™ Profiler 2024.2.0
Intel® Advisor 2024.2.0
Intel® oneAPI Threading Building Blocks 2021.13.0
Intel® Integrated Performance Primitives 2021.12.0
Intel® Integrated Performance Primitives Cryptography 2021.11.0
Intel® oneAPI Collective Communications Library 2021.13.0
Intel® oneAPI Data Analytics Library 2024.3.0
Intel® oneAPI Deep Neural Networks Library 2024.2.0

 

Note:  For Intel® AI Tools, please ensure you install the compatible  version of the Intel® oneAPI Base Toolkit.  Example:  2024.1 Base Kit with 2024.1 AI Tools.

New in Intel® oneAPI Base Toolkit 2024.2

Toolkit Level Updates

  • Take your application's efficiency to the next level with the Intel® oneAPI DPC++/C++ Compiler's enhanced SYCL* Graph capabilities, now featuring pause/resume support for better control and graph profiling to tune for more performance. Additionally the Intel® oneAPI DPC++/C++ Compiler delivers more SYCL performance on Windows with default context enabled. With the latest release the kernel compiler introduces SPIR-V support, and OpenCL* query support, allowing for greater flexibility and optimization in your compute kernels. 
  • Our latest OpenMP* enhancements include support for omp_target_memset() and omp_target_memset_async(), enabling developers to efficiently initialize large data on target devices, reducing overhead and accelerating parallel computing tasks. Additionally the compiler emits detailed remarks about OpenMP loop collapsing under the -qopt-report option. Gain valuable insights into your loop transformations and make informed decisions to fine-tune your application's performance.
  • Enhance your debugging experience in Microsoft* Visual Studio* and VS Code* with the Intel® Distribution for GDB*'s new Lane Variable Watch Window, allowing you to monitor and analyze variables more efficiently, leading to quicker problem resolution and enhanced application stability.
  • Strengthen the security of your applications with expanded Control-flow Enforcement Technology (CET) in Intel® Distribution for GDB* which now includes Shadow Stack capabilities to efficiently debug applications and enhance the reliability of your software.
  • Use Intel® VTune™ Profiler to gain insights into sub-optimal Intel® oneAPI Collective Communications Library (oneCCL) communication in your applications by finding out the time spent in oneCCL calls and identifying most active oneCCL communication tasks in your application. 
  • Intel® oneAPI DPC++ Library (oneDPL) adds new C++ Standard Template Library (STL) copy_if and inclusive_scan algorithm extensions for developers to write parallel programs for multiarchitecture devices.  The performance of many existing algorithms* are also improved on Intel and other vendors' GPUs.
  • Apps run faster on 5th Gen Intel® Xeon® Processors with Intel® oneAPI Threading Building Blocks (oneTBB) optimized thread synchronization to reduce startup latency
  • Apps run faster using oneTBB parallel_reduce improved data movement to avoid extra copying
  • Intel® oneAPI Math Kernel Library (oneMKL) 2024.2 introduces enhanced performance of 2D and 3D real and complex FFT targeted for Intel® Data Center GPU Max Series.
  • To extend sparsity functions across Intel® oneAPI Data Analytics Library (oneDAL) algorithms, this release adds DPC++ sparse gemm and gemv primitives and includes sparsity support for the logloss  function primitive.
  • Intel® oneAPI Collective Communications Library (oneCCL) introduces multiple enhancements that improve the utilization of system resources such as memory and I/O, unlocking even better performance.
  • Intel® Distribution for Python* added the following features:
    • The Data Parallel Control Library offers improved productivity with new sorting and summing functions along with updated documentation and bug fixes. 
    • The Data Parallel Extension for NumPy increases productivity with the addition of a new family of cumulative functions and improved linear algebra functions.
  • Intel® oneAPI Deep Neural Network Library (oneDNN) 2024.2 introduces:
    • Enhanced Performance for next generation client platforms: Experience faster and more efficient processing with broad production quality optimizations, maximizing the performance potential of upcoming AI enhanced Intel client processors.
    • Optimized Performance for next generation server platforms: Future-proof your systems with enhanced production quality optimizations, ensuring top-tier performance for upcoming Intel Xeon Scalable processors.
    • Improved Large Language Model Performance: Boost the efficiency of your AI workloads with support for int8 and int4 weight decompression in matmul, accelerating large language models with compressed weights for faster insights and results.
  • Intel® Integrated Performance Primitives added the following features:
    • Improved compression ratio and throughput with new optimization patch for zlib 1.3.1
    • Accelerated image processing capabilities on select color conversion functions using Intel® AVX-512
  • Intel® Integrated Performance Primitives Cryptography added the following features:
    • Enhanced data protection in post-quantum era, with new Intel-optimized LMS post-quantum crypto algorithm
    • Advanced AES-GCM performance on 5th Gen Intel® Xeon® Scalable Processors and Intel® Core™ Ultra processors, with simplified integration with new code sample
  • Visual AI and imaging apps using bindless textures can accelerate on multi-vendor GPUs, with Intel® DPC++ Compatibility Tool option enabled migration to SYCL* image API extension
  • Save time validating migrated SYCL is equivalent to original code using Intel® DPC++ Compatibility Tool to auto compare kernel run logs and report differences
  • Easily migrate to SYCL with Intel® DPC++ Compatibility Tool migrating 126 more commonly used CUDA APIs

Intel® oneAPI DPC++ Compiler 2024.2.0

  • The Intel® oneAPI DPC++/C++ Compiler added enhanced SYCL Graph capabilities, now featuring pause/resume support for better control and graph profiling to tune for more performance. 
  • The Intel® oneAPI DPC++/C++ Compiler delivers more SYCL performance on Windows with default context enabled. With the latest release the kernel compiler introduces SPIR-V support, and OpenCL* query support, allowing for greater flexibility and optimization in your compute kernels. 
  • Our latest OpenMP enhancements include support for omp_target_memset() and omp_target_memset_async(), enabling developers to efficiently initialize large data on target devices, reducing overhead and accelerating parallel computing tasks. Additionally the compiler emits detailed remarks about OpenMP loop collapsing under the -qopt-report option. Gain valuable insights into your loop transformations and make informed decisions to fine-tune your application's performance.
  • Ensure greater reliability, stability, and security in C++ applications that offload computational tasks to the GPU with the newly added device-side LLVM Address Sanitizer support in Intel® oneAPI DPC++/C++ Compiler to swiftly detect and diagnose memory-related bugs.

Intel® oneAPI DPC++ Library 2022.6.0

  • Intel® oneAPI DPC++ Library adds new C++ Standard Template Library (STL) inclusive_scan algorithm extension for developers to write parallel programs for multiarchitecture devices. 
  • The performance of many existing algorithms like reduce, min_element, max_element, minmax_elelment, is_partitioned, lexicograpical_compare, binary_search, lower_bound and upper_bound are also improved on Intel and other vendors' GPUs.

Intel® DPC++ Compatibility Tool 2024.2.0

  • Visual AI and imaging apps using bindless textures can accelerate on multi-vendor GPUs, with Intel® DPC++ Compatibility Tool option enabled migration to SYCL image API extension
  • Save time validating migrated SYCL is equivalent to original code using Intel® DPC++ Compatibility Tool to auto compare kernel run logs and report differences
  • Easily migrate to SYCL with Intel® DPC++ Compatibility Tool migrates 126 more commonly used CUDA APIs

Intel® oneAPI Math Kernel Library 2024.2.0

  • Intel®oneAPI Math Kernel Library (oneMKL) 2024.2 introduces enhanced performance of 2D and 3D real and complex FFT targeted for Intel® Data Center GPU Max Series.
  • Several other optimizations for various domains.
  • Various bug fixes

Intel® Distribution for GDB* 2024.2.0

  • Intel® Distribution for GDB* now supports Intel® Core™ Ultra processors on Windows*.
  • Intel® Distribution for GDB* has enhanced debugging experience in Microsoft* Visual Studio* and VS Code* with the new Lane Variable Watch Window, allowing you to monitor and analyze variables more efficiently, leading to quicker problem resolution and enhanced application stability.
  • Strengthen the security of your applications with expanded Control-flow Enforcement Technology (CET) support, now including Shadow Stack capabilities to efficiently debug applications and enhancing the reliability of your software.
  • Various improvements and bug fixes in GPU handling to efficiently debug applications that utilize GPU offload.

Intel® VTune™ Profiler 2024.2.0

  • Get insights into sub-optimal oneCCL communication in your applications by finding out the time spent in oneCCL calls and identifying most active oneCCL communication tasks in your application. 
  • Added support for .NET 8 and new Intel architectures code named  Sierra Forest and Grand Ridge.
  • Technical preview feature: Get a high-level view of potential bottlenecks in software performance analysis before exploring top-down microarchitecture metrics for deeper analysis. Currently supports 5th Gen Intel® Xeon® processors (code-named Emerald Rapids), 4th Gen Intel® Xeon® Scalable processors (code-named Sapphire Rapids), and the Intel® Xeon® CPU Max Series (code-named Sapphire Rapids HBM).
  • Faster performance profiling of GPU workloads running on a specific tile.  

Intel® Advisor 2024.2.0

 

  • Intel® Advisor added AMX profiling support on Sapphire Rapids. 
  • Added significant improvements in understanding compute kernels and multi-GPU setups.
  • Added HTML improvements and fixed all legacy Coverity issues.

Intel® oneAPI Threading Building Blocks 2021.13.0

  • Apps run faster on 5th Gen Xeon with Intel® oneAPI Threading Building Blocks (oneTBB) optimized thread synchronization to reduce startup latency
  • OpenVINO run up to 4X faster on ARM CPU (including Apple Mac) using oneTBB improved multi-thread synchronization strategies
  • Apps run faster using oneTBB parallel_reduce improved data movement to avoid extra copying

Intel® Integrated Performance Primitives 2021.12.0

What’s new in Intel® IPP:

  • Experience better compression ratio and throughput in your data compression tasks with new optimization patch for zlib 1.3.1
  • Accelerated image processing capabilities on select color conversion functions using Intel® AVX-512 VNNI
  • Enhanced stability and several bug fixes

What’s new in Intel® IPP Cryptography:

    • Enhanced data protection in post-quantum era, using Intel-optimized LMS post-quantum crypto algorithm for single buffer implementation.
    • Optimized and advanced AES-GCM performance on 5th Gen Intel® Xeon® Scalable Processors and Intel® Core™ Ultra processors, with simplified implementation with new code sample.
    • Maximize adaptability and streamline development with Clang 16.0 compiler support for Linux

Intel® oneAPI Collective Communications Library 2021.13.0

  • In this release, oneCCL introduces multiple enhancements that improves the utilization of system resources such as memory and I/O, unlocking even better performance.

Intel® oneAPI Data Analytics Library 2024.3.0

  • To extend sparsity functions across oneDAL algorithms, this release adds DPC++ sparse gemm and gemv primitives and includes sparsity support for the logloss  function primitive.

Intel® oneAPI Deep Neural Networks Library 2024.2.0

Intel® oneAPI Deep Neural Network Library (oneDNN) 2024.2 introduces:

  • Enhanced Performance for next generation client platforms: Experience faster and more efficient processing with broad production quality optimizations, maximizing the performance potential of upcoming AI enhanced Intel client processors.
  • Optimized Performance for next generation server platforms: Future-proof your systems with enhanced production quality optimizations, ensuring top-tier performance for upcoming Intel Xeon Scalable processors.
  • Improved Large Language Model Performance: Boost the efficiency of your AI workloads with support for int8 and int4 weight decompression in matmul, accelerating large language models with compressed weights for faster insights and results.

Deprecation Notices

  • SLES 15SPR3 and Ubuntu 20.04 support on CPU are deprecated with 2024.2 release and will be removed in a future release.
  • The Diagnostics Utility for Intel® oneAPI Toolkits is now deprecated and will be discontinued with the 2025.0 release.

Toolkit Level Updates

  • Intel oneAPI Base Toolkit 2024.0.1 now includes recent component patch releases. 
  • Patches are built on top of previous patch releases as needed.

Intel® oneAPI DPC++ Compiler 2024.0.2

  • Minor bug fixes

Intel® Integrated Performance Primitives Cryptography 2021.9.1 

  • This patch release fixes an algorithmic issue in the AES-XTS Intel® Advanced Vector Extensions 512 (Intel® AVX-512) code path.

Intel® oneAPI Collective Communications Library 2021.11.2

  • This update provides bug fixes to maintain driver compatibility for Intel® Data Center GPU Max Series.

Intel® oneAPI Data Analytics Library 2024.0.1

  • New features and bug fixes. See the Release Notes for more information.

Installation Instructions

Please visit Installation Guide for Intel oneAPI Toolkits

How to Start Using the Tools

Please reference:

Known Issues, Limitations and Workarounds

  1. Known Issue: The modulefiles included with Intel® VTune™ Profiler and Intel® Advisor incorrectly derive the component root path as "/". Workarounds for this issue include:
    • For Intel VTune Profiler:
      1. Click here to download the fixed tcl file and replace the existing 2024.0 file located in <install-dir>/vtune/2024.0/etc/modulefiles/vtune/
      2. Instead of using "module load" to set up the environment variables, run:

        $ source <install-dir>/vtune/latest/vtune-vars.sh

    • For Intel Advisor:

      1. Click here to download the fixed tcl file and replace the existing 2024.0 file located in <install-dir>/advisor/2024.0/etc/modulefiles/advisor/

      2. Instead of using "module load" to set up the environment variables, run:

        $ source <install-dir>/advisor/latest/advisor-vars.sh

  2. Known Issue: There is a known issue integrating Intel software developer tools (Intel® oneAPI Base Toolkit, Intel® HPC Toolkit, or their component products) into Microsoft Visual Studio* 2022 (17.7 or higher) on offline systems with the Windows Performance Toolkit (Win11SDK_WindowsPerformanceToolkit) installed. This results in an incomplete integration. To work around the issue, either enable an Internet connection during Intel developer tools installation or uninstall the Windows Performance Toolkit before installing Intel developer tools, after Intel developer tools are installed, reinstall the Windows Performance Toolkit.
  3. Known Issue: When using Intel® oneAPI DPC++/C++ Compiler on a Linux machine, users may run into an issue if the highest version of GNU gcc detected doesn't have the equivalent g++ package installed. More details on the error and workarounds can be found here.
  4. Please read the whitepaper on Challenges, tips, and known issues when debugging heterogeneous programs using DPC++ or OpenMP offload 
  5. Limitations
    1. Running any GPU code on a Virtual Machine is not supported at this time.
    2. If you have chosen to download the Get Started Guide to use offline, viewing it in Chrome may cause the text to disappear when the browser window is resized. To fix this problem, resize your browser window again, or use a different browser.
    3. Eclipse* 4.12: the code sample project created by IDE plugin from Makefile will not build. It is a known issue with Eclipse 4.12. Please use Eclipse 4.9, 4.10 or 4.11.


Release Notes for All Tools included in Intel® oneAPI Base Toolkit

Previous oneAPI Releases

Notices and Disclaimers

Intel technologies may require enabled hardware, software or service activation.

No product or component can be absolutely secure.

Your costs and results may vary.

© Intel Corporation. Intel, the Intel logo, and other Intel marks are trademarks of Intel Corporation or its subsidiaries. Other names and brands may be claimed as the property of others.

No license (express or implied, by estoppel or otherwise) to any intellectual property rights is granted by this document.

The products described may contain design defects or errors known as errata which may cause the product to deviate from published specifications. Current characterized errata are available on request.

Intel disclaims all express and implied warranties, including without limitation, the implied warranties of merchantability, fitness for a particular purpose, and non-infringement, as well as any warranty arising from course of performance, course of dealing, or usage in trade.