Speech and Translation AI

NVIDIA Riva

Build powerful, real-time voice AI with Nemotron™ speech open models and the NVIDIA® Riva library. Create custom voice agents that can listen, speak, transcribe, and translate instantly in multiple languages. Use Nemotron open models for speech recognition, translation, and text‑to‑speech—all powered by NVIDIA Riva for exceptional accuracy and lightning‑fast performance.

Get Started

Video | For Developers

Speech and Translation AI

NVIDIA Riva

Get Started

Video | Solution Brief | For Developers

Overview
Benefits
Use Cases
Starting Options
Customer Stories
Adopters
Resources
Next Steps

Overview
Benefits
Use Cases
Starting Options
Customer Stories
Adopters
Resources
Next Steps

Get Started

Overview

What Is NVIDIA Riva?

NVIDIA Riva is a collection of GPU-accelerated microservices for building real-time, customizable speech AI applications. With industry-leading automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) developers can deliver accurate, natural, and responsive speech experiences optimized for production. Deploy easily across cloud, data center, or edge environments, and plug directly into Nemotron open models so you can choose the right ASR and multilingual speech translation quality for each application.

NVIDIA Nemotron Speech Models Achieve Top Spots on ASR Leaderboard

State-of-the-art Nemotron ASR models, Canary-Qwen-2.5B and Parakeet-TDT-0.6B-V2, now hold top spots on the Artificial Analysis (AA) ASR leaderboard, including Canary-Qwen-2.5B’s #1 ranking on the VoxPopuli subset.

Try Now

Benefits

Benefits of NVIDIA Riva

Multilingual Transcriptions and Expressive Voice Generation

Achieve high multilingual transcription and translation accuracy, and provide out-of-the-box, expressive, professional female and male voices with state-of-the-art models pretrained on thousands of hours of audio.

Fully Customizable

Customize across ASR pipelines for different languages, accents, domains, vocabulary, and context for the best possible accuracy for your use case and across TTS pipelines for the brand voice and intonation you want.

Flexible Deployments

Provide consistent experiences to hundreds of thousands of concurrent users with higher inference performance than existing technology, and deploy anywhere—in data centers, on premises, in the cloud, at the edge, or in embedded devices.

Enterprise-Grade AI

Accelerate the development and deployment of production-grade, multilingual, voice-enabled AI applications with NVIDIA Riva, part of the NVIDIA AI Enterprise platform for accelerating AI development and deployment.

Use Cases

How Riva Is Being Used

See how NVIDIA AI supports industry use cases, and jump-start your speech AI development with curated examples.

AI Virtual Assistant
Agent Assist
Transcription
AI Translation
AI Robot

AI Virtual Assistant

Companies are deploying AI virtual assistants to automatically address the queries of millions of customers and employees around the clock. With Riva speech and translation AI microservices, these assistants provide helpful and natural responses at every turn of the conversation despite background noise, poor sound quality, and diverse speaker dialects and accents.

Explore AI Virtual Assistants for Telecom

Agent Assist

Consumers expect contact center agents to resolve their issues quickly and efficiently. To meet these expectations and deliver the best customer and agent experiences possible, enterprises across industries are implementing agent-assist technology powered by Riva speech and translation AI.

Learn More About Agent Assist

Transcription

With hundreds of millions of online meetings held daily, video conferencing has become an indispensable tool for enterprises. With Riva real-time transcription, video conferencing applications achieve impressive accuracy in live captioning and meeting summarizations, accommodating users with worldwide accents and diverse, domain-specific vocabulary.

Learn More About Transcription and NVIDIA ASR models

AI Translation

In the global economy, businesses operate across many countries and serve customers with diverse linguistic and cultural backgrounds. This diversity in global languages poses a unique challenge in finding native speakers or training employees in multiple languages. Riva translation empowers accurate and effective communication, facilitating smooth global interactions.

Explore Translation for Contact Centers

AI Robot

AI robots are increasingly found in hospitals, airports, and retail stores worldwide. They aid frontline workers by handling daily repetitive tasks in restaurants and manufacturing facilities, assist customers in locating items in stores, and support physicians and nurses in patient care. With Riva, it’s easy to add speech and translation AI to service robots.

Read How to Add Speech to Robots

Starting Options

Ways to Get Started With NVIDIA Riva

Use the right tools and technologies to build and deploy fully customizable, multilingual speech and translation AI applications.

Try

Experience Riva through a UI-based portal for exploring and prototyping with NVIDIA-managed endpoints, available for free through NVIDIA's API catalog.

Try Now

Deploy

Get a free license to try NVIDIA AI Enterprise for 90 days using your existing infrastructure.

Request a 90-Day License

Experience

Access NVIDIA-hosted infrastructure and guided hands-on labs that include step-by-step instructions and examples, available for free on NVIDIA LaunchPad.

Access Hands-On Labs

Compare Ways to Get Started

Customer Stories

How Industry Leaders Are Driving Innovation With Riva

Speech AI for Award-Winning Customer Care

Customer: T-Mobile

Products: NVIDIA Riva, NVIDIA-Certified Systems

Technologies: NVIDIA Data Center GPUs, NVIDIA NeMo, NVIDIA Riva

Read Case Study

Telecommunications

World-Class Speech AI for the Best Video Conferencing Experience

Customer: RingCentral

Products: NVIDIA DGX, NVIDIA Riva

Technologies: NVIDIA Data Center GPUs, NVIDIA NeMo, NVIDIA Riva, NVIDIA Triton Inference Server

Read Case Study

Academia / Higher Education

Automating Real-Time Arabic Speech Recognition

Customer: Tarteel.ai

Products: NVIDIA Riva, NVIDIA-Certified Systems

Technologies: NVIDIA NeMo, NVIDIA Riva, NVIDIA Data Center GPUs

Read Case Study

Adopters

Leading Adopters Across All Industries

Customers
Partners
Service Delivery Partners

Resources

The Latest in NVIDIA Riva Resources

Blogs
Sessions
Training
Videos

View All Blogs

View More Sessions

Get Started With Highly Accurate Custom ASR

Learn to build, train, fine-tune, and deploy a GPU-accelerated ASR service with Riva that includes customized features.

Enroll Now

Talk to Your Data in Your Native Language

Join AI experts to learn how to build, fine-tune, and deploy production-ready, multilingual speech and translation AI on top of LLM-based applications, enabling your chatbots to speak to your customers in their natural languages.

Watch On-Demand Session

Try Riva on NVIDIA LaunchPad

Have an existing speech AI project? Apply to get hands-on experience testing and prototyping your conversation-based solutions with speech skills in the high-performance Riva software stack that’s deployable today.

Apply Now

Using Speech AI for Transcription, Translation, and Voice

Build world-class, fully customizable, speech AI applications such as intelligent virtual assistants, audio transcription services, and digital avatars.

Watch Now

Reinvent Contact Center Experiences With NVIDIA Riva

By generating an accurate transcript of customer interactions in real time, Riva enables AI to provide contextual insights, measure sentiment, and recommend the next-best action to an agent, ensuring a great personalized experience.

Watch Now

Robot Dog Fetches Snacks Across Town

Watch as Spot uses speech AI to order snacks across town without an internet connection. Instead of uploading voice commands to the cloud and processing them on the server, Spot processes everything locally for seamless, efficient performance and delivery.

Watch Now

View More Videos

Building Speech AI Applications

Explore how to get started with integrating and deploying Riva ASR and TTS models in production with high-performance inference and minimal effort.

Read Ebook

An Introduction to NVIDIA Riva

Learn about Riva’s architecture, key features, and components for building speech and translation AI services.

Read Blog See All Technical Riva Blogs

NVIDIA Parlays Win in Voice Challenge

Read how a team of NVIDIANs won the LIMMITS ’24 challenge, which asked contestants to recreate in real time a speaker’s voice in English or any of six languages spoken in India with the appropriate accent.

Read Blog See All Riva Blogs

Next Steps

Ready to Get Started?

Use the right tools and technologies to build and deploy fully customizable, multilingual, speech and translation AI applications.

For Developers

Explore everything you need to start developing with NVIDIA Riva, including the latest documentation, tutorials, technical blogs, and more.

Start Developing

Get in Touch

Talk to an NVIDIA product specialist about moving from pilot to production with the security, API stability, and support of NVIDIA AI Enterprise.

NVIDIA Riva

NVIDIA Riva

Overview

What Is NVIDIA Riva?

NVIDIA Nemotron Speech Models Achieve Top Spots on ASR Leaderboard

Benefits

Benefits of NVIDIA Riva

Multilingual Transcriptions and Expressive Voice Generation

Fully Customizable

Flexible Deployments

Enterprise-Grade AI

Use Cases

How Riva Is Being Used

AI Virtual Assistant

Agent Assist

Transcription

AI Translation

AI Robot

Starting Options

Ways to Get Started With NVIDIA Riva

Try

Deploy

Experience

Customer Stories

How Industry Leaders Are Driving Innovation With Riva

Speech AI for Award-Winning Customer Care

World-Class Speech AI for the Best Video Conferencing Experience

Automating Real-Time Arabic Speech Recognition

Adopters

Leading Adopters Across All Industries

Resources

The Latest in NVIDIA Riva Resources

Get Started With Highly Accurate Custom ASR

Talk to Your Data in Your Native Language

Try Riva on NVIDIA LaunchPad

Using Speech AI for Transcription, Translation, and Voice

Reinvent Contact Center Experiences With NVIDIA Riva

Robot Dog Fetches Snacks Across Town

AI2Labs

Avaya

C-DAC

NCS

Learn more.

RingCentral

Learn more.

Snap

T-Mobile

Learn more.

Building Speech AI Applications

An Introduction to NVIDIA Riva

NVIDIA Parlays Win in Voice Challenge

Next Steps

Ready to Get Started?

For Developers

Get in Touch