Visible to Intel only — GUID: GUID-12E85A28-93D9-46CE-B27D-D085288EA556
Introduction
Getting Started
Parallelization
Intel® Iris® Xe GPU Architecture
GPU Execution Model Overview
SYCL* Thread Mapping and GPU Occupancy
Kernels
Using Libraries for GPU Offload
Host/Device Memory, Buffer and USM
Host/Device Coordination
Using Multiple Heterogeneous Devices
Compilation
Optimizing Media Pipelines
OpenMP Offloading Tuning Guide
Debugging and Profiling
GPU Analysis with Intel® Graphics Performance Analyzers (Intel® GPA)
Reference
Terms and Conditions
Sub-groups and SIMD Vectorization
Removing Conditional Checks
Registerization and Avoid Register Spills
Shared Local Memory
Pointer Aliasing and the Restrict Directive
Synchronization among Threads in a Kernel
Considerations for Selecting Work-group Size
Reduction
Kernel Launch
Executing Multiple Kernels on the Device at the Same Time
Submitting Kernels to Multiple Queues
Avoid Redundant Queue Construction
Visible to Intel only — GUID: GUID-12E85A28-93D9-46CE-B27D-D085288EA556
Reference
For more information, see:
Intel® oneAPI DPC++/C++ Compiler Developer Guide and Reference
Intel® Fortran Compiler Classic and Intel® Fortran Compiler Developer Guide and Reference
OpenMP Features and Extensions Supported in Intel® oneAPI DPC++/C++ Compiler
Fortran Language and OpenMP Features Implemented in Intel® Fortran Compiler (Beta)
Developer Reference for Intel® oneAPI Math Kernel Library - C