Find out how to build and optimize accelerated heterogeneous applications on multiple GPU clusters using a combination of OpenACC, CUDA- aware MPI, and NVIDIA profiling tools.
Course Prerequisites
Basic experience with C/C++
What you will learn
In this course, you’ll learn:
- How to profile and optimize your CPU-only applications to identify hot spots for acceleration
- How to use OpenACC directives to GPU accelerate your codebase
- How to optimize data movement between the CPU and GPU accelerator Upon completion, you'll be ready to use OpenACC to GPU accelerate CPU-only applications.
Additional information
The minimum order quantity for NVIDIA self-paced courses is 10.
When adding a course to the shopping cart, a quantity of 10 is automatically added. It may also take 2-3 working days for your course access to be activated. You will receive an email from us with all the necessary information.
When adding a course to the shopping cart, a quantity of 10 is automatically added. It may also take 2-3 working days for your course access to be activated. You will receive an email from us with all the necessary information.