Optimizing CUDA Machine Learning Codes With NVIDIA Nsight™ Profiling Tools

Optimizing CUDA Machine Learning Codes With NVIDIA Nsight™ Profiling Tools

NVIDIA NIC-NVI-CMLC

USD 30.00
excl. VAT

NVIDIA Developer Tools are a collection of applications, spanning desktop and mobile targets, that enable developers to build, debug, profile, and develop class-leading and cutting-edge software using the latest visual computing hardware from NVIDIA.

Nsight Systems provide developers with a system-wide visualization of an application’s performance.Developers can optimize bottlenecks to scale efficiently across any number or size of CPU and GPU—from large servers to the smallest systems on chip. Nsight Compute is an interactive kernel profiler for CUDA applications. It provides detailed performance metrics and API debugging via a user interface and command-line tool.

By the time you complete this course, you’ll be able to use Nsight Systems and Nsight Compute to analyze and optimize CUDA applications. Following best practices, you’ll begin by using Nsight Systems to analyze overall application structure and explore
parallelization opportunities before turning to Nsight Compute to analyze and optimize individual CUDA kernels.

Course Prerequisites

Familiarity with machine learning applications using CUDA. We suggest Fundamentals of Accelerated Computing
With CUDA C/C++

What you will learn

In this course, you’ll learn the effective use of two powerful NVIDIA developer tools: Nsight Systems and Nsight Compute.

What’s included

PLEASE NOTE: It may take 2-3 business days for your course access to be activated. You will receive an email from us with all necessary details

Additional information

The minimum order quantity for NVIDIA self-paced courses is 10.
When adding a course to the shopping cart, a quantity of 10 is automatically added. It may also take 2-3 working days for your course access to be activated. You will receive an email from us with all the necessary information.