Visit your regional NVIDIA website for local content, pricing, and where to buy partners specific to your country.
Purpose-built to solve the world’s largest computing problems.
As models explode in complexity, accelerated computing and energy efficiency are becoming critical to meet the demands of AI. The NVIDIA Grace™ CPU is a groundbreaking Arm® CPU with uncompromising performance and efficiency. It can be tightly coupled with a GPU to supercharge accelerated computing or deployed as a powerful, efficient standalone CPU. The NVIDIA Grace CPU is the foundation of next-generation data centers and can be used in diverse configurations for different data center needs.
The NVIDIA GB200 Grace Blackwell Superchip combines two NVIDIA Blackwell Tensor Core GPUs and a Grace CPU and can scale up to the GB200 NVL72, a massive 72-GPU system connected by NVIDIA® NVLink®, to deliver 30X faster real-time inference for large language models.
The NVIDIA Grace Hopper™ Superchip combines the Grace and Hopper architectures using NVIDIA® NVLink®-C2C to deliver a CPU+GPU coherent memory model for accelerated AI and high-performance computing (HPC) applications.
The NVIDIA Grace CPU Superchip uses the NVLink-C2C technology to deliver 144 Arm Neoverse V2 cores and 1 terabyte per second (TB/s) of memory bandwidth.
Learn how NVIDIA Grace CPUs are powering the latest large-memory supercomputers.
The NVIDIA GB200 NVL72 Grace Blackwell Superchip is the architecture for the next generation of AI, supercharging real-time trillion-parameter language models.
World’s first HBM3e processor offers groundbreaking memory and bandwidth for the era of accelerated computing and generative AI.
Arm-based NVIDIA Grace Hopper™ Superchip, BlueField®-3 DPU and Aerial™ SDK power revolutionary architecture for generative AI and 5G/6G communications.
Take a look at the latest energy-efficient Arm supercomputers for climate science, medical research and more, powered by NVIDIA Grace CPU.
GH200-powered systems join 400+ system configurations that global systems makers are rolling out to meet the surging demand for generative AI.
Learn how QCT and Supermicro are adopting modular designs to quickly and cost-effectively build multiple data center configurations for a wide range of AI, high-performance computing (HPC) and 5G applications.
Solving the largest AI and HPC problems requires high-capacity and high-bandwidth memory (HBM). The fourth-generation NVIDIA NVLink-C2C delivers 900 gigabytes per second (GB/s) of bidirectional bandwidth between the NVIDIA Grace CPU and NVIDIA GPUs. The connection provides a unified, cache-coherent memory address space that combines system and HBM GPU memory for simplified programmability. This coherent, high-bandwidth connection between CPU and GPUs is key to accelerating tomorrow’s most complex problems.
NVIDIA Grace is the first server CPU to harness LPDDR5X memory with server-class reliability through mechanisms like error-correcting code (ECC) to meet the demands of the data center, while delivering 2X the memory bandwidth and up to 10X better energy efficiency compared to today’s server memory. The LPDDR5X solution coupled with NVIDIA Grace’s large high-performance last-level cache delivers the bandwidth necessary for large models while reducing system power to maximize performance for next-generation workloads.
As the parallel compute capabilities of GPUs continue to advance, workloads can still be gated by serial tasks run on the CPU. A fast and efficient CPU is a critical component of system design to enable maximum workload acceleration. The NVIDIA Grace CPU integrates Arm Neoverse V2 cores with the NVIDIA Scalable Coherency Fabric to deliver high performance in a power-efficient design, making it easier for scientists and researchers to do their life’s work.
Generative AI is memory and compute intensive. The NVIDIA GB200 Superchip uses 380GB of HBM memory, delivering over 4.5X the GPU memory bandwidth of the NVIDIA H100 Tensor Core GPU. The high-bandwidth memory in Grace Blackwell is connected to CPU memory over NVLink-C2C to provide almost 860GB of fast-access memory to the GPU, delivering the memory capacity and bandwidth needed to handle the world’s most complex generative AI and accelerated computing workloads.
NVIDIA provides in-depth support for NVIDIA Grace with performance-tuning guides, developer tools, and libraries.