High-Performance GPU Acceleration—Part 1: Code Design
High-Performance GPU Acceleration—Part 1: Code Design
Subscribe Now
Stay in the know on all things CODE. Updates are delivered to your inbox.
Overview
Heterogeneous computing comes with the challenge of designing code that can work in multiprocessor and multiaccelerator environments. Developers need to be equipped with the right set of metrics to make informed design and optimization decisions that take advantage of target hardware.
In Part 1 of this two-part webinar series, technical consulting engineer Cory Levels focuses on designing software for efficient offload from CPUs to GPUS—even before final hardware is available—using Intel® Advisor. Using a walkthrough of an ISO 3D Isotropic Finite Difference example, learn how to:
- Optimize your CPU application for memory and compute.
- Identify efficient GPU offload opportunities and quantify the potential performance speedup.
- See performance headroom of your GPU offloaded code against hardware limitations, and get insights for an effective optimization roadmap.
Featured Software
- Get Intel Advisor as part of the Intel® oneAPI Base Toolkit—a foundational set of tools and libraries for developing high-performance, data-centric applications across diverse architectures.
- Get the stand-alone version of Intel Advisor.
Other Resources
- Sign up for an Intel® Developer Cloud for oneAPI account—a free development sandbox with access to the latest Intel hardware and oneAPI software.
- Explore oneAPI including developer opportunities and benefits.
- Subscribe to Code Together— an interview series that explores the challenges at the forefront of cross-architecture development. Each biweekly episode features industry VIPs who are blazing new trails through today’s data-centric world. Available wherever you get your podcasts.
Design code for efficient vectorization, threading, memory use, and accelerator offload. Supports C, C++, Fortran, SYCL*, OpenMP*, OpenCL™ programs, and Python*.
You May Also Like
Related Videos