Ultimate Performance for AI/ML, HPC, and Visualization Workloads in the Cloud

Diagnosing cancer. Predicting hurricanes. Automating business operations for improved efficiencies. These are some of the breakthroughs possible when you use accelerated compute to uncover the insights hiding in vast volumes of data. Amazon Web Services (AWS) and NVIDIA have partnered to deliver the most powerful and advanced GPU-accelerated cloud to help clients build a more intelligent future.

GPU-Accelerated Cloud Services

Amazon Elastic Cloud Compute (EC2)

With its new P4d instance available now, AWS is paving the way for another bold decade of accelerated computing powered with the latest NVIDIA A100 Tensor Core GPU.


Unprecedented Acceleration at Scale for AI, Data Analytics, and HPC

Amazon EC2 P4d (NVIDIA A100)


Universal Accelerator for All Workloads, Including Cloud Gaming

Amazon EC2 G4 (NVIDIA T4)


World’s First Tensor Core GPU to Accelerate HPC and AI

Amazon EC2 P3 (NVIDIA V100)

NGC Containers

Run a broad range of deep learning, machine learning, high-performance computing (HPC), and visualization applications on Amazon EC2 P3, G4 and P4d instances with quick, easy access to performance-engineered containers from the NGC software hub. NGC also offers pre-trained models and scripts to build optimized models for common use cases like image classification, object detection, text-to-speech, and more. To run NGC containers and take full advantage of NVIDIA A100, V100 and T4 GPUs on AWS, NVIDIA developed the NVIDIA Deep Learning AMI, available in AWS Marketplace.


Apache MXNET

Apache MXNet is a fast, scalable training and inference deep learning framework. It includes the Gluon interface to let developers of all skill levels get started with deep learning in the cloud, edge devices, and mobile apps. Working with NVIDIA, AWS engineers and researchers have pre-optimized neural machine translation (NMT) algorithms on Apache MXNet. This approach allows developers and data scientists to train faster than ever before on NVIDIA AI platforms.

GPU-Accelerated Virtualized Graphics

The NVIDIA Quadro Virtual Workstation (Quadro vWS) AMI for GPU-accelerated graphics helps creative and technical professionals maximize their productivity from anywhere by accessing the most demanding professional design and engineering applications from the cloud. The Amazon EC2 G4 (NVIDIA T4) instances, combined with the Quadro vWS AMI enables the industry’s most advanced 3D graphics platform, including the latest real-time ray tracing with NVIDIA RTX technology in a virtual machine. NVIDIA M60 (G3) and V100 (P3) GPUs are also supported. All the instances are available on Windows Server 2016Windows Server 2019, and Ubuntu 18.04.

Quadro Virtual Workstations on AWS Marketplace


The NVIDIA Gaming AMI driver enables cloud gaming on NVIDIA T4 server GPUs. Games can be streamed at resolutions up to 4K and 60 FPS. The new G4 instance type features the NVIDIA T4 GPU and supports this driver on Windows Server 2016, Windows Server 2019, and 64-bit Ubuntu (18.04).

