This site requires Javascript in order to view all its content. Please enable Javascript in order to access all the functionality of this web site. Here are the instructions how to enable JavaScript in your web browser.

Instructor-Led Workshop
Fundamentals of Accelerated Computing with OpenACC

Request a workshop for your organization

Notify me when public workshops are available

This workshop covers the fundamental tools and techniques needed for accelerating C/C++ or Fortran applications to run on massively parallel GPUs with OpenACC. You will learn how to write code, configure code parallelization with OpenACC, optimize memory movements between the CPU and GPU accelerator, and implement the workflow that you have learned on a new task—accelerating a fully functional, but CPU-only, Laplace Heat Equation code for massive performance gains. At the end of the workshop, you will have access to additional resources for creating new GPU-accelerated applications on your own.

Learning Objectives

By participating in this workshop, you’ll have an understanding of the fundamental tools and techniques for GPU-accelerating C/C++ or Fortran applications with OpenACC and be able to:

Profile and optimize your CPU-only applications to identify hot spots for acceleration.
Use OpenACC directives to GPU-accelerate your codebase.
Optimize data movement between the CPU and GPU accelerator.

Download Workshop Datasheet (PDF 80.3 KB)

Workshop Outline

Introduction (15 mins)	Meet the instructor. Create an account at courses.nvidia.com/join
Introduction to Parallel Programming (50 mins)	Learn about parallelism in a conceptual way, as well as how to express it with OpenACC. Topics that will be covered are as follows: Introduction to parallelism The goals of OpenACC Basic parallelization of code using OpenACC
Break (10 mins)
Profiling with OpenACC (50 mins)	Learn how to build and compile an OpenACC code, the importance of profiling, and how to use the NVIDIA Nsight^™ Systems profiler. Topics that will be covered are as follows: Compiling sequential and OpenACC code The importance of code profiling Profiling sequential and OpenACC multicore code Technical introduction to the code used in introductory modules
Break (45 mins)
Introduction to OpenACC Directives (50 mins)	Learn how to parallelize your code with OpenACC directives and understand the differences between parallel, kernel, and loop directives. Topics that will be covered are as follows: The Parallel directive The Kernels directive The Loop directive
Break (10 mins)
GPU Programming with OpenACC (50 mins)	Learn about the differences between GPUs and multicore CPUs, and manage memory with CUDA Unified Memory. Topics that will be covered are as follows: Definition of a GPU Basic OpenACC data management CUDA Unified Memory Profiling GPU applications
Break (15 mins)
Data Management with OpenACC (50 mins)	Learn how to explicitly manage data movement with OpenACC data directives to reduce data transfers. Topics that will be covered are as follows: OpenACC data directive/clauses OpenACC structured data region OpenACC unstructured data region OpenACC update directive Data management with C/C++ Structs/Classes
Break (10 mins)
Loop Optimizations with OpenACC (50 mins)	Understand the various levels of parallelism on a GPU and learn ways to extract more parallelism with OpenACC by optimizing loops in your code. Topics that will be covered are as follows: Seq/Auto clause Independent clause Reduction clause Collapse clause Tile clause Gang, Worker, Vector
Break (10 mins)
Final Review (60 mins)	Review key learnings and answer questions. Complete the assessment and earn a certificate. Complete the workshop survey.
Next Steps	Continue learning with these DLI trainings: Fundamentals of Accelerated Computing with CUDA C/C++ Scaling Workloads Across Multiple GPUs with CUDA C++ High Performance Computing with Containers

Workshop Details

Duration: 8 hours

Price: Contact us for pricing.

Prerequisites:

Basic C/C++ or Fortran competency, including familiarity with variable types, loops, conditional statements, functions, and array manipulations.
No previous knowledge of GPU programming is assumed.

Tools, libraries, and frameworks: NVIDIA^® Nsight^™, OpenACC

Assessment Type:

Code-based

Certificate: Upon successful completion of the assessment, participants will receive an NVIDIA DLI certificate to recognize their subject matter competency and support professional career growth.

Hardware Requirements: Desktop or laptop computer capable of running the latest version of Chrome or Firefox. Each participant will be provided with dedicated access to a fully configured, GPU-accelerated workstation in the cloud.

Language: English

If your organization is interested in boosting and developing key skills in AI, accelerated data science, or accelerated computing, you can request instructor-led training from the NVIDIA DLI.

Request a Workshop

Instructor-Led Workshop
Fundamentals of Accelerated Computing with OpenACC

Learning Objectives

Workshop Outline

Workshop Details

Questions?

Read our FAQs.

Inquire about NVIDIA Deep Learning Institute services.

For technical questions, check out the NVIDIA Developer Forums.

Instructor-Led Workshop Fundamentals of Accelerated Computing with OpenACC

Learning Objectives

Workshop Outline

Workshop Details

Questions?

Read our FAQs.

Inquire about NVIDIA Deep Learning Institute services.

For technical questions, check out the NVIDIA Developer Forums.

Instructor-Led Workshop
Fundamentals of Accelerated Computing with OpenACC