NVIDIA Riva provides deep-learning-based automatic speech recognition (ASR) and text-to-speech (TTS) skills for AI practitioners and developers. ASR and TTS are voice interfaces in speech-AI -based applications, such as call center agent assists, digital assistants, and video call transcriptions.
ASR converts speech to text and usually is the first step in a speech pipeline, so its transcription accuracy influences all downstream tasks. TTS generates human-like voices from text.
NVIDIA Riva is used across all industries—from telecommunications and finance to healthcare, retail, and automotive—since every company needs to interact with its customers.