Complete portfolio of switches and fabric management appliances for AI and supercomputing at scale.
Overview
NVIDIA Quantum InfiniBand is the world’s only fully offloadable, NVIDIA In-Network Computing platform. With unparalleled data throughput and density, it provides the dramatic leap in performance needed to achieve unmatched data center performance with less cost and complexity. NVIDIA Quantum InfiniBand also provides self-healing network capabilities, enhanced quality of service (QoS), congestion control, and adaptive routing to provide the highest overall application throughput.
NVIDIA Scalable Hierarchical Aggregation and Reduction Protocol (SHARP)™ v4 for 9x higher performance
Over 10,000 nodes in two-level fat tree
SHARP accelerated, for up to 2.5x higher performance
Fastest recovery for 1,000x higher resiliency
InfiniBand is a high-performance interconnect standard built for low-latency, high-bandwidth communication between compute nodes. For AI training and HPC workloads that distribute computation across hundreds or thousands of GPUs, network latency directly affects job completion time. NVIDIA Quantum InfiniBand is also the only interconnect platform with In-Network Computing support, which offloads collective operations like AllReduce onto the switch fabric itself, freeing GPU cycles for actual computation.
Conventional switch designs use pluggable optical transceivers to connect the switch ASIC to fiber. Each transceiver is a discrete component that consumes power, generates heat, and can fail. At cluster scale, this means thousands of individual transceivers to manage and replace over the deployment's lifetime.
The NVIDIA Quantum-X InfiniBand Photonics switch integrates optical engines directly with the switch silicon, eliminating pluggable transceivers entirely. This shortens the electrical path between chip and fiber, reducing power consumption and latency, while fewer discrete components improve reliability and simplify long-term operations.
In-Network Computing is the ability of NVIDIA Quantum switches to execute data processing operations inside the network fabric, without involving host CPUs or GPUs. The key capability is NVIDIA SHARP, which offloads collective operations like AllReduce directly onto the switches. For distributed AI training, this reduces time spent on inter-node communication and frees GPU cycles for computation. SHARP accelerates the collective operations at the core of most parallel simulation codes, reducing latency and CPU overhead. Applications leveraging NCCL, OpenMPI, UCX, or other standards-compliant MPI stacks benefit with minimal code changes, making In-Network Computing relevant across domains from AI training to molecular dynamics to climate modeling.
For new large-scale AI deployments, the best choice is NVIDIA Quantum-X800 InfiniBand. It delivers 800Gb/s XDR connectivity and is designed for next-generation AI factories that need maximum scale-out bandwidth, low latency, adaptive routing, congestion control, and SHARP in-network computing for AI and HPC collectives. Quantum-2 NDR 400Gb/s remains a strong option for Hopper-generation or cost-optimized deployments, but for new Blackwell-scale clusters, Quantum-X800 provides the best long-term performance runway. The Q3200 is the Quantum-X800 2U air-cooled switch platform and is well suited for large or mixed-generation environments that need high port density and a practical migration path.
NVIDIA Unified Fabric Manager (UFM) is the fabric management platform for NVIDIA Quantum InfiniBand deployments. UFM (offered as an integrated appliance as well as standalone software) handles provisioning, real-time monitoring, diagnostics, and proactive troubleshooting across the fabric. NVIDIA UFM Cyber-AI is a dedicated appliance for detecting anomalous network behavior that could indicate security threats or hardware issues before they impact jobs.
This is a common consideration for organizations expanding existing clusters or building out in phases. In general, NVIDIA Quantum InfiniBand is designed for forward-compatibility: adapters and switches operating at different InfiniBand generations will auto-negotiate to the appropriate common speed. For deployments that need to connect Quantum-X800 (XDR, 800 Gb/s) and Quantum-2 (NDR, 400 Gb/s) environments, NVIDIA recommends the Q3200, a multi-speed switch purpose-built to bridge these two generations. Specific design considerations around topology, port mapping, and fabric configuration using the Q3200 are covered in the NVIDIA InfiniBand switch documentation.
Yes. The NVIDIA Skyway™ appliance is designed specifically for this use case, providing an InfiniBand-to-Ethernet gateway. Skyway allows InfiniBand compute clusters to communicate with Ethernet-attached storage systems, management networks, or other Ethernet infrastructure without requiring changes to either environment. Multiple Skyway appliances can be deployed and scaled incrementally as cross-fabric bandwidth demands grow.
Next Steps
This online tool can help you configure clusters based on fat tree with two levels of switch systems and Dragonfly+ topologies.
Explore deep technical training topics on NVIDIA Quantum InfiniBand networking through the NVIDIA Academy.
Visit the NVIDIA marketplace for more information on how to purchase NVIDIA networking solutions.