AI-RAN (artificial intelligence—radio access network) is a technology that enables the full integration of AI into the radio access network to realize transformative gains in operational performance, deliver new AI-based services, and unlock monetization opportunities. It enhances connectivity across mobile networks by leveraging AI to improve spectral efficiency, dynamic traffic handling, and real-time responsiveness.
AI-RAN enables the full integration of AI into the RAN hardware and software to deliver new AI-based services and monetization opportunities in addition to transformative gains in network utilization, spectral efficiency, and performance. As mobile network traffic grows exponentially due to AI-driven applications, AI-RAN offers a scalable solution to maintain performance while reducing cost.
The underlying infrastructure for AI-RAN is built using a completely homogeneous general-purpose, accelerated computing platform, without any RAN-specific hardware components, so that it can run both cellular and AI workloads concurrently with deterministic performance for each. It embodies cloud-native principles such as on-demand scaling, multi-tenancy, and containerization of both workloads.
The software for AI-RAN is built using fully software-defined and AI-native principles to allow containerization and acceleration of AI and RAN workloads, ensuring full benefits of the underlying accelerated computing infrastructure. AI-RAN supports the concurrent execution of AI models and RAN workloads, unlocking new opportunities for service innovation and infrastructure reuse.
Figure 1: AI-RAN integrates AI and RAN into the same accelerated computing platform.
With this accelerated and unified hardware-software foundation, AI-RAN enables the deployment of 5G/6G RAN and AI workloads on a shared, distributed, and accelerated cloud infrastructure. It converts the RAN infrastructure from a single-purpose to a multipurpose cloud infrastructure.
AI-RAN furthers the goals of Open RAN by leveraging a fully software-defined general-purpose platform architecture that enables open interfaces to deliver flexibility, interoperability, and cost-efficiency for the RAN. AI-RAN fosters a flexible and interoperable ecosystem by aligning with cloud-native and Open RAN principles, inviting innovation across the telecom value chain.
Related
There are three specific areas of AI integration into the RAN, as outlined by the AI-RAN Alliance—a community of telecom companies and academia with the mission to drive innovation and adoption of AI-RAN.
Figure 2: The domains of AI-RAN
AI-RAN lays the technology foundation for the telecommunications industry to integrate the rapid advancements in AI technologies into the cellular telecommunications roadmap. It positions the telecom industry to fully embrace AI integration as it evolves toward 6G.
The surge in AI and generative AI applications is creating increased demands on cellular networks, driving demand for AI inferencing at the edge, and necessitating new approaches to handle these workloads. By enabling intelligent traffic management and low-latency services at the network edge, AI-RAN contributes to more responsive and consistent customer experiences across mobile applications.
At the same time, advances in AI-based radio signal processing techniques are showing compelling results versus traditional techniques and promising transformative gains in radio efficiency and performance.
As the industry begins its 6G journey, AI-RAN built with general-purpose commercial-off-the-shelf (COTS) servers and software-defined acceleration provides enhanced capabilities to process increased AI and non-AI traffic efficiently, compared to traditional RAN systems that are based on purpose-built hardware, whether it be custom application-specific integrated circuits (ASICs) or system on chips (SoCs) with embedded accelerators.
AI-RAN creates new revenue opportunities from hosting AI workloads and enables AI to be integrated into the operations of the RAN to optimize network performance, automate management tasks, and enhance overall user experience. For network operators, AI-RAN enables better resource allocation and monetization opportunities through AI service hosting and improved infrastructure efficiency.
AI-RAN enables the deployment of 5G RAN and AI workloads on a shared, distributed, and accelerated cloud infrastructure, thereby addressing the two key challenges communication service providers (CoSPs) have had for a long time:
AI-RAN’s core mission is maximizing the ROI for service providers by delivering the following key benefits to CoSPs:
For CoSPs, AI-RAN is transformational because it:
The key building blocks for AI-RAN include the following:
NVIDIA provides an AI-RAN reference architecture, built with NVIDIA MGX™ GH200 or Grace Blackwell-based platforms and NVIDIA BF3, CX7/CX8 NICs, and Spectrum-X™ switch fabric, which is fully programmable with a software upgrade and can accommodate the evolving landscape of AI applications and the evolution to future 6G networks.
NVIDIA has worked with our partners to define, build, and validate NVIDIA Cloud Partners (NCP) Telco Reference Architecture (RA). The goal of this RA is to create a blueprint that can drive rapid deployment of AI-RAN for the CoSP customers. The key elements of this RA include:
Figure 3: NVIDIA AI-RAN reference architecture with server rack and network topology