Semi-custom AI infrastructure with industry-proven AI scale-up performance and rack-scale architecture.
NVIDIA NVLink™ Fusion is a rack-scale AI infrastructure platform that enables hyperscalers and custom ASIC designers to integrate custom CPUs and XPUs with the world-leading NVLink scale-up interconnect and OCP MGX rack-scale server architecture. Leverage NVIDIA’s battle-tested AI technology stack and proven rack-scale design and ecosystem to reach the market faster, reduce development costs and deployment risks, and achieve better performance for a higher return on investment.
The NVLink Fusion modular technology portfolio includes NVIDIA GPUs, NVIDIA Vera™ CPUs, NVLink scale-up networking, co-packaged optics (CPO) switches, NVIDIA ConnectX® SuperNICs, BlueField® DPUs, and Mission Control software. The comprehensive ecosystem includes ASIC designers, CPU and IP providers, OEMs/ODMs, and component suppliers, all designed to enable rapid deployment of custom AI silicon.
Benefits
Unlocking the full potential of AI factories requires swift, seamless communication among all accelerators. NVIDIA NVLink 6 can connect 72 XPUs all-to-all at 3.6 TB/s per XPU to boost AI performance and return on investment.
The established NVLink Fusion supplier ecosystem provides all of the components required for full rack-scale deployment based on the OCP MGX architecture, from rack and chassis to power delivery and cooling systems, eliminating the development costs and deployment risks associated with a new rack design.
By leveraging NVIDIA’s battle-tested technology stack and ecosystem of ASIC designers, CPU and IP providers, and OEM/ODMs, hyperscalers can achieve faster time to market and faster time to revenue.
As hyperscalers already deploy full NVIDIA rack solutions, NVLink Fusion allows heterogeneous silicon offerings while standardizing around a common rack design, accelerating AI factory deployment, and simplifying management.
Platform
NVIDIA NVLink 6 and NVLink Switch Chip enable 260 TB/s of bandwidth in a single 72-accelerator NVLink domain (NVL72) and deliver 4x bandwidth efficiency with NVIDIA Scalable Hierarchical Aggregation and Reduction Protocol (SHARP)™ FP8 support.
NVIDIA NVLink-C2C extends the industry-leading NVLink technology to a chip-to-chip interconnect. This enables the creation of a new class of integrated products with NVIDIA partners, built via chiplets, allowing NVIDIA GPUs or CPUs to have a high-bandwidth coherent connection with custom silicon.
Adopters
Learn how NVIDIA NVLink Fusion addresses the growing demands of complex AI models.