Real-time accelerated computing for every quantum processor.
Overview
The open NVIDIA® NVQLink™ platform architecture tightly integrates quantum hardware with state-of-the-art accelerated computing to power the development of quantum processing units (QPUs) at scale. Using real-time APIs within the NVIDIA CUDA-Q™ software platform, researchers can easily leverage NVQLink for the low-latency, high-throughput connections they need to perform control tasks like calibration and quantum error correction (QEC). QPUs equipped with NVQLink allow QPU operators to unify quantum and accelerated compute resources to develop hybrid quantum applications.
The CUDA-Q real-time API allows developers to take advantage of NVQLink’s low-latency, high throughput connection to quantum hardware. A simple remote function API call within CUDA-Q’s kernel-based programming model makes it easy to accelerate hybrid applications and develop scalable QEC workflows.
Rigetti Computing
Highlights
1 With sparsity.
Workloads
Accelerating the quantum workflow from calibration to full fault tolerance.
Benefits
Together, NVQLink and CUDA-Q provide a platform for error-corrected quantum applications.
Develop and deploy error-corrected applications to your QPU using extensible CUDA-Q libraries for QEC and more.
Work across all major quantum controllers and quantum processor modalities.
Move hundreds of gigabites per second of data between the quantum controller and compute host with the industry’s most scalable, low-latency network.
Join a rapidly growing software ecosystem for accelerated quantum supercomputing.
Providers
Resources
NVQLink is a platform architecture for tightly coupling a GPU-accelerated server to a quantum processing unit (QPU).
NVQLink serves two needs:
The elements that define the NVQLink architecture are:
Validation of an NVQLink system is performed by a cudaq-realtime library function that measures the round trip latency of a QSC-Host callback. The recognized implementation of this benchmark is provided in the open source CUDA-Q repository and exercises the core functionality of cudaq-realtime. Because cudaq-realtime is the supported way to build real-time applications on NVQLink, its API is a requisite for NVQLink compatibility.
Users of NVQLink are free to choose among many options for the Real-time Host, the Quantum System Controller, and the Real-time Network. NVIDIA provides a reference implementation for the Real-time Network, and third party network architectures are compatible if they support the cudaq-realtime library.
The reference architecture for the NVQLink network is based on an ultrahigh performance and widespread form of ethernet called RoCE, and has the following elements:
Spectrum-X switch (optional) – If needed, an ethernet switch to expand the network radix and aggregate data to the Real-time Host from many points in the QSC.
There are many types of QPU, and there are various functions that QPU builders and users may want to offload to the Real-time Host with vastly different response time requirements. Because of this, we don’t prescribe a specific latency number.
NVQLink defines a common, open-source benchmark to ensure all compatible systems deliver transparent and reproducible network latency, allowing users to select the best solution for their needs.
In the context of NVQLink, the Real-time Host is intended to perform latency-critical compute to support quantum error correction (QEC) and online autocalibration of the QPU. These workloads have latency requirements from the millisecond range to the microsecond range on various QPU types, but regardless of the QPU type these workloads are essential for maximizing QPU performance and uptime.
NVQLink Real-time Hosts are available from our partners, each of which may have their own differentiated offerings. Ask your vendor what CUDA-Q realtime latency they support.
Creating a Real-time Host also requires support from the connected QSC. Please ensure your QSC provider is supporting the upgrade.
An existing CUDA-capable server can be converted to an NVQLink Real-time Host by ensuring the Real-time Network components are installed: an NVIDIA ConnectX or BlueField NIC and cudaq-realtime (available after March 2026 in the cuda-quantum repository).
A Real-time Host connected to a compatible QSC is validated using the CUDA-Q realtime callback latency benchmark.
Get Started
Connect with NVIDIA Quantum experts to learn more about NVQLink.