Visible to Intel only — GUID: GUID-033677EC-B97E-4AC5-A8F1-4452F2641CC6
Introduction
Getting Started
Parallelization
Intel® Iris® Xe GPU Architecture
GPU Execution Model Overview
SYCL* Thread Mapping and GPU Occupancy
Kernels
Using Libraries for GPU Offload
Host/Device Memory, Buffer and USM
Host/Device Coordination
Using Multiple Heterogeneous Devices
Compilation
Optimizing Media Pipelines
OpenMP Offloading Tuning Guide
Debugging and Profiling
GPU Analysis with Intel® Graphics Performance Analyzers (Intel® GPA)
Reference
Terms and Conditions
Sub-groups and SIMD Vectorization
Removing Conditional Checks
Registerization and Avoid Register Spills
Shared Local Memory
Pointer Aliasing and the Restrict Directive
Synchronization among Threads in a Kernel
Considerations for Selecting Work-group Size
Reduction
Kernel Launch
Executing Multiple Kernels on the Device at the Same Time
Submitting Kernels to Multiple Queues
Avoid Redundant Queue Construction
Visible to Intel only — GUID: GUID-033677EC-B97E-4AC5-A8F1-4452F2641CC6
oneAPI GPU Optimization Guide
Welcome to the oneAPI GPU Optimization Guide. This document gives tips for getting the best GPU performance for oneAPI programs.
- Introduction
- Getting Started
- Parallelization
- Intel® Iris® Xe GPU Architecture
- GPU Execution Model Overview
- SYCL* Thread Mapping and GPU Occupancy
- Kernels
- Using Libraries for GPU Offload
- Host/Device Memory, Buffer and USM
- Host/Device Coordination
- Using Multiple Heterogeneous Devices
- Compilation
- Optimizing Media Pipelines
- OpenMP Offloading Tuning Guide
- Debugging and Profiling
- GPU Analysis with Intel® Graphics Performance Analyzers (Intel® GPA)
- Reference
- Terms and Conditions