Build and deploy fully customizable multilingual speech and translation AI applications.
NVIDIA® Riva is a GPU-accelerated multilingual speech and translation AI software development kit for building fully customizable, real-time conversational AI pipelines—including automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) applications—that can be deployed in clouds, in data centers, at the edge, or on embedded devices. NVIDIA Riva is part of the NVIDIA AI Enterprise software platform, which streamlines development and deployment of production AI.
Select the language and check out how Riva ASR delivers highly accurate transcription in real time by providing an input through your microphone or uploading a .wav file from your device.
Note: The duration of each sample is limited to 30 seconds.
Select a voice and type in a test sentence to hear Riva’s out-of-the-box English female or male voice.
Note: Input text is limited to 400 characters.
0 / 400
Use of Riva skills is subject to NVIDIA Riva terms of use. Your data will be used to improve NVIDIA products and services.
Achieve high transcription accuracy for bilingual and multilingual translations of English, Spanish, Mandarin, Hindi, Russian, Arabic, Japanese, Korean, German, Portuguese, French, and Italian, and deploy two out-of-the-box expressive professional female and male voices for English, Spanish, German, Italian, and French with state-of-the-art models pretrained on thousands of hours of audio on NVIDIA supercomputers.
Customize across ASR pipelines for different languages, accents, domains, vocabulary, and context for the best possible accuracy for your use case and across TTS pipelines for the voice and intonation you want.
Provide consistent experiences to your customers for hundreds of thousands of input streams with higher inference performance versus existing technology and on deployment of your choice—in data centers, on premises, in the cloud, at the edge, or in embedded devices.
Get unlimited usage on all clouds, access to NVIDIA AI experts, and long-term support for production deployments by purchasing NVIDIA Riva, a premium edition of the NVIDIA AI Enterprise software platform. Or get access to free Riva containers for development for 90 days as a member of the NVIDIA Developer Program. Or apply to try Riva for free on NVIDIA LaunchPad, a program that provides short-term access to enterprise-grade NVIDIA hardware and software via a web browser.
Accelerate development with packaged AI workflows for audio transcription and intelligent virtual assistants. These AI workflows include AI frameworks, and pretrained models, as well as resources such as Helm charts, Jupyter Notebooks, and documentation to help you jump-start building AI solutions.
Have an upcoming speech AI project? Apply for access to NVIDIA Riva with free, curated labs. Access step-by-step guided labs for speech AI with ready-to-use hardware, software, sample data, and applications.
NCS used NVIDIA Riva TTS to customize a Singaporean voice with local pronunciation, tone, and accent for thousands of Breeze—a driver’s companion app—monthly active users.
T-Mobile uses NVIDIA Riva ASR in their call center to accurately transcribe customer conversations and provide real-time recommendations to help agents quickly resolve customer queries.
Data Monsters added a speech pipeline to their Plabook app using NVIDIA Riva to help students read, assess phoneme-level accuracy, and provide individualized feedback.
Artisight developed smart hospital solutions that automate check-in and notify waiting patients via voice-enabled kiosks. These solutions integrate a customized speech AI application and deliver real-time performance using GPU-accelerated NVIDIA Riva text-to-speech skills.
With NVIDIA Riva, RingCentral achieved unparalleled real-time transcription accuracy for video meetings, serving millions of users with diverse accents and domain-specific jargon.
Tarteel uses NVIDIA Riva and NVIDIA NeMo™ to provide real-time feedback on Quran recitation at scale, enabling Muslims, instructors, content creators, and researchers to engage with the Quran.
Floatbot leverages NVIDIA Riva and NVIDIA TAO for their customized Singaporean English voice AI applications, automating call centers for insurance carriers and finance clients globally.
Join a data science and AI technology expert to learn about cutting-edge NVIDIA Riva speech and translation AI solutions that are revolutionizing the industry, from virtual assistants and digital avatars for improved outreach, claims management, ordering, and provisioning to fraud detection systems for risk mitigation.
Join NVIDIA, AT&T, Kore.ai, Deloitte, Appen, and Sutherland to explore the benefits and challenges of using multi-language ASR, translation, and TTS to deliver faster and more accurate customer self-service, enhance live agent productivity, and boost operational efficiencies for enterprises.
Watch this on-demand webinar to learn how to build intelligent virtual assistants in the form of voice-enabled digital agents. We’ll showcase how to deploy flexible, fully customizable solutions to improve customer satisfaction.
Watch T-Mobile as they walk through their model development with NVIDIA NeMo, cloud deployment with NVIDIA Riva, their efforts to identify and remove bias in their models, and the future of speech-to-text at T-Mobile.
Watch this on-demand webinar to learn how speech AI is revolutionizing customer experiences in finance, broadcasting, and retail by removing barriers across languages and dialects, driving operational efficiency, and helping businesses stay ahead by improving accuracy and enhancing performance.
Join Motorola and Softserve to learn how to deliver the most accurate transcription, translation, and engaging voices at the speed and scale conversational AI experiences demand.
Watch Infosys, Quantiphi, Talkmap, and NVIDIA on-demand to learn how telecommunications companies are using AI to improve operational efficiency and enhance customer engagement.
Learn best practices from Infosys and Quantiphi for seamlessly integrating speech and translation AI into agent-assist solutions, ensuring smooth and effective customer-agent communication.
Understand the key features in Riva that help you build speech and translation AI services.
Get everything you need to start building your speech and translation AI pipelines with NVIDIA Riva, including tutorials, Jupyter Notebooks, and documentation.
Read a technical walkthrough on how to build and deploy speech and translation AI applications using Riva.
Explore how to get started with integrating and deploying Riva ASR and TTS models in production with high-performance inference and minimal effort.
In 2021, AI2Labs spun off from Yoozoo Games as a local tech startup in Singapore. AI2Labs innovates, experiments, and develops AI products and applications, enabling efficient processes and improving sustainability and business outcomes.
AI2Labs integrated Riva into their Speakr—domain-specific speech AI—speech recognition API to accommodate the intricacies of Asian speech and business domains and achieved state-of-the-art Singlish translation accuracy.
Avaya specializes in cloud communications and workstream collaboration solutions, providing unified communications, contact center, communications platform as a service (CPaaS), and services with their OneCloud platform.
Avaya integrated the NVIDIA Riva speech-to-text engine for real-time captions at scale. Riva enables better transcription quality, lower word-error rate, and economical delivery.
For over 10 years, the Applied AI Group at C-DAC in Pune, India, has focused on research and development of speech technology. They’ve successfully created a cutting-edge speech-to-text (STT) system for Indic languages such as Hindiand Marathi. The group continues to advance their work by exploring AI-enabled, open-source deep learning frameworks, libraries, and tools for creating STT and speech-enabled applications for other Indic and low-resource languages. Experiments were conducted using various neural network architectures and topologies from NVIDIA’s open-source NeMo framework, with Citrinet and Conformer-CTC network topologies proving to be effective in building and training neural acoustic models for speech recognition. These models were trained on single- and multi-node Param Siddhi AI systems, optimizing training time and performance. Finally, the models were deployed for real-time and batch-mode inference using the Riva GPU-accelerated production pipeline.
NCS, a subsidiary of Singtel Group, is a leading technology services firm with presence in Asia Pacific and partners with governments and enterprises to advance communities through technology. Combining the experience and expertise of its 12,000-strong team across 61 specialisations, NCS provides differentiated and end-to-end technology services to clients with its NEXT capabilities in digital, data, cloud and platforms, as well as core offerings in application, infrastructure, engineering and cybersecurity. NCS also believes in building a strong partner ecosystem with leading technology players, research institutions and start-ups to support open innovation and co-creation.
NCS uses NVIDIA Riva TTS in Breeze—the driver’s companion app—for voice-guided navigation, live traffic and road condition updates, real-time parking rates, and electronic road pricing rates and operating hours, to help Singapore drivers experience smooth driving journeys.
breeze.com.sg/
www.ncs.co
Customer Story
RingCentral, a leading provider of global enterprise cloud communications, collaboration, and contact center solutions, serves millions of users. The RingCentral platform empowers collaboration from any location and device, improving business efficiency and customer satisfaction. RingCentral uses NVIDIA Riva for video conferencing transcription for 200,000 concurrent users on their platform.
www.ringcentral.com
GTC Session
Snap is a camera and social media company that enables multimedia message creation with filters and effects. To create more interactive experiences, Snapchat users play with Lenses—a feature that adds real-time effects into snaps—over 6 billion times per day.
NVIDIA Riva’s noise- and lingo-optimized speech AI service is integrated into Snap AR Lens Studio, enabling creators—artists and developers—to build gripping augmented reality (AR) experiences.
T-Mobile, a supercharged Un-carrier, delivers an advanced 4G LTE and transformative 5G network for the best customer experience. To empower contact center agents, T-Mobile implements Expert Assist. This AI-based software uses NVIDIA Riva to transcribe real-time customer conversations that feed recommenders and assist thousands of agents.
With Riva, T-Mobile fine-tunes automatic speech recognition models on custom datasets and interprets customer jargon accurately across noisy environments.
www.t-mobile.com
We'll answer your questions and help with your organization's needs.
NVIDIA Privacy Policy