NVIDIA NVLink Fusion

Semi-custom AI infrastructure with industry-proven AI scale-up performance and rack-scale architecture.

Overview

World Class Scale-Up Performance

NVIDIA NVLink™ Fusion is a rack-scale AI infrastructure platform that enables hyperscalers and custom ASIC designers to integrate custom CPUs and XPUs with the world-leading NVLink scale-up interconnect and OCP MGX rack-scale server architecture. Leverage NVIDIA’s battle-tested AI technology stack and proven rack-scale design and ecosystem to reach the market faster, reduce development costs and deployment risks, and achieve better performance for a higher return on investment.

The NVLink Fusion modular technology portfolio includes NVIDIA GPUs, NVIDIA Vera™ CPUs, NVLink scale-up networking, co-packaged optics (CPO) switches, NVIDIA ConnectX® SuperNICs, BlueField® DPUs, and Mission Control software. The comprehensive ecosystem includes ASIC designers, CPU and IP providers, OEMs/ODMs, and component suppliers, all designed to enable rapid deployment of custom AI silicon.

AWS Integrates AI Infrastructure with NVIDIA NVLink Fusion for Trainium4 Deployment

Learn how AWS is using NVLink Fusion to accelerate Trainium4 deployment.

Integrating Semi-Custom Compute into Rack-Scale Architecture with NVIDIA NVLink Fusion

Learn how NVIDIA NVLink Fusion allows hyperscalers to build semi-custom AI infrastructure, integrating their ASICs or CPUs with NVIDIA GPUs, while standardizing on a single scalable hardware infrastructure.

Using NVLink Fusion, high-performance AI factories can scale quickly, benefiting from all solution components that make the NVIDIA rack-scale architecture.

Benefits

NVLink Fusion Benefits

World-Class Scale-Up Performance

Unlocking the full potential of AI factories requires swift, seamless communication among all accelerators. NVIDIA NVLink 6 can connect 72 XPUs all-to-all at 3.6 TB/s per XPU to boost AI performance and return on investment.

Lower Development Costs

The established NVLink Fusion supplier ecosystem provides all of the components required for full rack-scale deployment based on the OCP MGX architecture, from rack and chassis to power delivery and cooling systems, eliminating the development costs and deployment risks associated with a new rack design.

Accelerated Time to Market

By leveraging NVIDIA’s battle-tested technology stack and ecosystem of ASIC designers, CPU and IP providers, and OEM/ODMs, hyperscalers can achieve faster time to market and faster time to revenue.

Single Unified Architecture

As hyperscalers already deploy full NVIDIA rack solutions, NVLink Fusion allows heterogeneous silicon offerings while standardizing around a common rack design, accelerating AI factory deployment, and simplifying management.

Platform

NVIDIA NVLink Fusion Platform

NVIDIA NVLink

NVIDIA NVLink 6 and NVLink Switch Chip enable 260 TB/s of bandwidth in a single 72-accelerator NVLink domain (NVL72) and deliver 4x bandwidth efficiency with NVIDIA Scalable Hierarchical Aggregation and Reduction Protocol (SHARP)™ FP8 support.

NVIDIA NVLink-C2C

NVIDIA NVLink-C2C extends the industry-leading NVLink technology to a chip-to-chip interconnect. This enables the creation of a new class of integrated products with NVIDIA partners, built via chiplets, allowing NVIDIA GPUs or CPUs to have a high-bandwidth coherent connection with custom silicon.

Adopters

NVLink Fusion Ecosystem

Scaling AI Inference Performance with NVLink Fusion

Learn how NVIDIA NVLink Fusion addresses the growing demands of complex AI models.