Speech and Translation AI

NVIDIA Riva

Build and deploy fully customizable multilingual speech and translation AI for your large language model and retrieval-augmented generation based applications.

Get Started

Video | Solution Brief | For Developers

Introduction
Demos
Benefits
Starting Options
Case Studies
Adopters
Resources
Next Steps

Introduction
Demos
Benefits
Starting Options
Case Studies
Adopters
Resources
Next Steps

Get Started

What Is NVIDIA Riva?

NVIDIA® Riva is a set of GPU-accelerated multilingual speech and translation microservices for building fully customizable, real-time conversational AI pipelines. Riva includes automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) and is deployable in all clouds, in data centers, at the edge, or on embedded devices. With Riva, organizations can add speech and translation interfaces with large language models (LLMs) and retrieval-augmented generation (RAG) to transform chatbots into engaging and expressive multilingual assistants and avatars.

Unveiling End-To-End Speech and Translation AI Magic

Deliver AI chatbots with the state-of-the-art multilingual transcription, translation, and voices.

Watch Session

See Riva in Action

Speech-to-Text
Text-to-Speech

Try NVIDIA Riva Automatic Speech Recognition

Select the language and check out how Riva ASR delivers highly accurate transcription in real time by providing an input through your microphone or uploading a .wav file from your device.

Note: The duration of each sample is limited to 30 seconds.

Language

Try saying something

Upload .wav

Try NVIDIA Riva Text-to-Speech

Select a voice and type in a test sentence to hear Riva’s out-of-the-box English female or male voice.

Note: Input text is limited to 400 characters.

Use of Riva skills is subject to NVIDIA Riva terms of use. Your data will be used to improve NVIDIA products and services.

NVIDIA Riva Benefits

Highly Accurate and Expressive Multilingual Voices

Achieve high transcription accuracy for bilingual and multilingual translations and deploy out-of-the-box expressive professional female and male voices with state-of-the-art models pretrained on thousands of hours of audio on NVIDIA supercomputers.

Watch Video: World-Class Automatic Speech Recognition (46 Seconds)

Fully Customizable

Customize across ASR pipelines for different languages, accents, domains, vocabulary, and context for the best possible accuracy for your use case and across TTS pipelines for the voice and intonation you want.

Watch Video: Controllable Text-to-Speech (42 seconds)

Flexible Deployments

Provide consistent experiences to your customers for hundreds of thousands of input streams with higher inference performance versus existing technology and on deployment of your choice—in data centers, on premises, in the cloud, at the edge, or in embedded devices.

Watch Video: NVIDIA Riva Embedded Demo (10:44 Minutes)

Starting Options

Get Started With NVIDIA Riva

Use the right tools to build and deploy fully customizable, multilingual speech and translation AI applications.

Experience APIs and Interactive Demos

For individuals looking to experience Riva, the API catalog offers a UI-based playground and access to NVIDIA-managed API endpoints for free as a great starting point.

Experience Now

Try Before You Buy

For enterprises looking to try Riva before purchasing NVIDIA AI Enterprise for production, there are two options to get started for free:

Without Infrastructure:
For those without existing infrastructure, NVIDIA offers free hands-on labs through NVIDIA LaunchPad.

Access Hands-On Labs

With Infrastructure:
For those with existing infrastructure, NVIDIA offers a free evaluation license to try NVIDIA AI Enterprise for 90 days.

Request a 90-Day Trial

Compare Starting Options

Case Studies

T-Mobile uses NVIDIA Riva ASR in their call center to accurately transcribe customer conversations and provide real-time recommendations to help agents quickly resolve customer queries.

Learn More

T-Mobile uses NVIDIA Riva ASR in their call center to accurately transcribe customer conversations and provide real-time recommendations to help agents quickly resolve customer queries.

Learn More

NCS used NVIDIA Riva TTS to customize a Singaporean voice with local pronunciation, tone, and accent for thousands of monthly active Breeze users—a driver’s companion app.

Learn More

Tarteel uses NVIDIA Riva and NVIDIA NeMo™ to provide real-time feedback on Quran recitation at scale, enabling Muslims, instructors, content creators, and researchers to engage with the Quran.

Learn More

With NVIDIA Riva, RingCentral achieved unparalleled real-time transcription accuracy for video meetings, serving millions of users with diverse accents and domain-specific jargon.

Learn More

Data Monsters added a speech pipeline to their Plabook app using NVIDIA Riva to help students read, assess phoneme-level accuracy, and provide individualized feedback.

Learn More

Artisight developed smart hospital solutions that automate check-ins and notify waiting patients via voice-enabled kiosks. These solutions integrate a customized speech AI application and deliver real-time performance using GPU-accelerated NVIDIA Riva text-to-speech skills.

Learn More

Explore More Success Stories

Leading Adopters Across All Industries

Customers
Partners
Service Delivery Partners

Hear From Experts

Speech AI for Impactful Contact Centers

Explore how AT&T, Kore.ai, Deloitte, and Sutherland benefit from using multi-language ASR, translation, and TTS to deliver faster and more accurate customer self-service, enhance live agent productivity, and boost operational efficiencies for enterprises.

Watch Session

The Future of Customer Service With AT&T

Learn from data science and AI technology expert about cutting-edge NVIDIA Riva speech and translation AI solutions that are revolutionizing the industry, from virtual assistants and digital avatars for improved outreach, claims management, ordering, and provisioning to fraud detection systems for risk mitigation.

Watch Session

Build an AI Voice-Enabled Virtual Assistant

Watch this on-demand webinar to learn how to build intelligent virtual assistants in the form of voice-enabled digital agents. We’ll showcase how to deploy flexible, fully customizable solutions to improve customer satisfaction.

Watch Webinar

Unveiling End-to-End Speech and Translation AI Magic

Check out how Motorola and SoftServe deliver the most accurate transcription, translation, and engaging voices at the speed and scale conversational AI experiences demand.

Watch Session

Transform Your Business With Speech AI

Watch this on-demand webinar to learn how speech AI is revolutionizing customer experiences in finance, broadcasting, and retail by removing barriers across languages and dialects, driving operational efficiency, and helping businesses stay ahead by improving accuracy and enhancing performance.

Watch Speech AI Day Session

Telcos Transform Customer Experiences With Conversational AI

Watch Infosys, Quantiphi, Talkmap, and NVIDIA on-demand to learn how telecommunications companies are using AI to improve operational efficiency and enhance customer engagement.

Watch On-Demand Webinar

Enabling Contact Center Agents Through Speech AI

Learn best practices from Infosys and Quantiphi for seamlessly integrating speech and translation AI into agent-assist solutions, ensuring smooth and effective customer-agent communication.

Watch On-Demand Webinar

Speech-to-Text at Scale With T-Mobile

Watch T-Mobile as they walk through their model development with NVIDIA NeMo, cloud deployment with NVIDIA Riva, their efforts to identify and remove bias in their models, and the future of speech-to-text at T-Mobile.

Watch T-Mobile Session

Transform Your Business With Speech AI

Watch Speech AI Day Session

Unveiling End-to-End Speech and Translation AI Magic

Join Motorola and Softserve to learn how to deliver the most accurate transcription, translation, and engaging voices at the speed and scale conversational AI experiences demand.

Watch Speech AI Day Session

Telcos Transform Customer Experiences With Conversational AI

Watch Infosys, Quantiphi, Talkmap, and NVIDIA on-demand to learn how telecommunications companies are using AI to improve operational efficiency and enhance customer engagement.

Watch On-Demand Webinar

Enabling Contact Center Agents Through Speech AI

Learn best practices from Infosys and Quantiphi for seamlessly integrating speech and translation AI into agent-assist solutions, ensuring smooth and effective customer-agent communication.

Watch On-Demand Webinar

View More Sessions

More Resources

Get an Introduction

Understand the key features in Riva that help you build speech and translation AI services.

Read Blog

Explore Getting Started Resources

Get everything you need to start building your speech and translation AI pipelines with NVIDIA Riva, including tutorials, Jupyter Notebooks, and documentation.

Get Started

Explore Technical Blogs

Read a technical walkthrough on how to build and deploy speech and translation AI applications using Riva.

Explore Riva Blogs

Check Out an Ebook

Explore how to get started with integrating and deploying Riva ASR and TTS models in production with high-performance inference and minimal effort.

Read Now

Next Steps

Ready to Get Started?

Find the right license to build and deploy fully customizable, multilingual speech and translation AI applications, or explore more development resources.

Get in Touch

Talk to an NVIDIA product specialist about moving from pilot to production with the security, API stability, and support of NVIDIA AI Enterprise.

Get the Latest on NVIDIA Riva

Stay Informed

NVIDIA Riva

What Is NVIDIA Riva?

Unveiling End-To-End Speech and Translation AI Magic

See Riva in Action

Try NVIDIA Riva Automatic Speech Recognition

Try NVIDIA Riva Text-to-Speech

NVIDIA Riva Benefits

Highly Accurate and Expressive Multilingual Voices

Fully Customizable

Flexible Deployments

Starting Options

Get Started With NVIDIA Riva

Experience APIs and Interactive Demos

Try Before You Buy

Case Studies

Leading Adopters Across All Industries

Hear From Experts

Speech AI for Impactful Contact Centers

The Future of Customer Service With AT&T

Build an AI Voice-Enabled Virtual Assistant

Unveiling End-to-End Speech and Translation AI Magic

Transform Your Business With Speech AI

Telcos Transform Customer Experiences With Conversational AI

Enabling Contact Center Agents Through Speech AI

Speech-to-Text at Scale With T-Mobile

Transform Your Business With Speech AI

Unveiling End-to-End Speech and Translation AI Magic

Telcos Transform Customer Experiences With Conversational AI

Enabling Contact Center Agents Through Speech AI

More Resources

Get an Introduction

Explore Getting Started Resources

Explore Technical Blogs

Check Out an Ebook

Next Steps

Ready to Get Started?

Get in Touch

Get the Latest on NVIDIA Riva

AI2Labs

Avaya

C-DAC

NCS

Learn more.

RingCentral

Learn more.

Snap

T-Mobile

Learn more.

Contact an NVIDIA AI Enterprise Sales Representative

Contact Us