Accelerating CUDA C++ Applications With Concurrent Streams

Accelerating CUDA C++ Applications With Concurrent Streams

NVIDIA NIC-NVI-ACAW

EUR 26,90
exkl. MwSt.

The concurrent overlap of GPU computation and the transfer of memory to and from the GPU can drastically improve the performance of CUDA applications. In this workshop you will learn to utilize CUDA Streams to perform copy/compute overlap in CUDA C++ applications by:

• Learning the rules and syntax governing the use of concurrent CUDA Streams
• Refactoring and optimizing an existing CUDA C++ application to perform copy/compute overlap

Upon completion, you will be able to build robust and efficient CUDA C++ applications that can leverage copy/compute overlap for significant performance gains.

Kursvoraussetzungen

Professional experience programming CUDA C/C++ applications, including the use of the nvcc compiler, kernel launches, gridstride loops,host-to-device and device-to-host memory transfers, and CUDA error handling; Experience using Makefiles to compile C/C++ code.

Was du lernen wirst

The following topics are covered in this course:

  • CUDA C++
  • Concurrent CUDA Streams
  • Copy/Compute Overlap
  • Nsight Systems

Zusätzliche Informationen

The minimum order quantity for NVIDIA self-paced courses is 10.
When adding a course to the shopping cart, a quantity of 10 is automatically added. It may also take 2-3 working days for your course access to be activated. You will receive an email from us with all the necessary information.
Schreiben Sie Ihre eigene Bewertung

Nur registrierte Nutzer können Bewertungen abgeben. Bitte melden Sie sich an oder Erstellen Sie ein Benutzerkonto