Digital Health / Healthcare and Life Sciences
Hippocratic AI aims to combat clinician shortages in healthcare by harnessing the power of AI agents, which augments clinical staff with extra eyes, ears and a voice. To support this vision, Hippocratic AI has launched a voice-enabled patient engagement platform and groundbreaking app store where clinicians can develop, validate, and share AI-driven scripts tailored to a wide range of healthcare use cases. Each script undergoes rigorous safety testing before becoming available, and contributing clinicians help create safety-tested scripts that expand access and improve the quality of care for patients everywhere.
To achieve this vision, a core focus of the platform is to utilize patient-facing real-time AI inference. This capability demands substantial computing power, including the use of NVIDIA H200 GPUs, and presents challenges in infrastructure scalability and resource allocation. To address these challenges and ensure seamless, low-latency voice interactions, Hippocratic AI is collaborating with NVIDIA to optimize inference performance for on-demand deployment.
Hippocratic AI
AWS
Customized Inference
Impact + Product
Impact + Product
The current healthcare system is under pressure with aging patient populations, aging workforces, and increased needs to do more with less. This is evident in practices like triage and risk stratification, which focus on the most urgent cases but can leave some patients at risk of falling through the cracks when it comes to future care needs.
In collaboration with NVIDIA, Hippocratic AI developed Empathy Inference, a technology for swift, natural conversations that forge emotional connections with patients.
AI agents can provide an infinite supply of care, speaking every language, remembering every conversation, and being clinically safe. This can help in providing continuous and comprehensive care to everyone, addressing the limitations of human resources. For example, AI can monitor patients during heat waves, check blood pressure daily, and provide timely interventions.
Serve Robotics
Hippocratic AI developed specialized generative AI healthcare agents—built on NVIDIA technology and deployed on AWS—to help shape the future of patient care. Expanding upon this, Hippocratic AI has launched its AI Agent App Store, a groundbreaking platform that empowers clinicians to design and deploy customized AI healthcare agents—no coding required. In just under 30 minutes, healthcare professionals can create agents tailored to specific medical tasks and workflows.
The model is focused on learning from the healthcare experts—clinicians can create and contribute AI scripts for various healthcare use cases. These scripts are validated and safety-tested before being made available, giving patients faster access to safe, reliable care. Clinicians benefit by sharing their expertise at scale and helping to accelerate the development of new use cases.
The App Store debuts with over 300 AI agents spanning 25 medical specialties, supporting use cases such as cervical cancer check-ins, postpartum mental health monitoring, wound care, and diabetes screening. Every AI agent undergoes a rigorous three-step validation process, including licensure verification, development testing, and clinical review by a network of over 6,000 nurses and 300 physicians to ensure safety and efficacy.
“We trained our model differently than others and have always focused on inference, since our runtime is the key environment. Our model is actually 22 models—one gigantic 400B model doing the talking, with 21 others supervising it to ensure it doesn't say anything unsafe. That’s a lot of inference—22 models' worth. We're running 4.2 trillion parameters each time, so we use up a ton.”
Munjal Shah
Co-Founder and CEO of Hippocratic AI
Hippocratic AI is redefining patient engagement by prioritizing safety, accuracy, and empathy in its AI-powered clinical assistants. At the core is its Polaris constellation architecture, which deploys over 25 task-specific large language models (LLMs), each with 70B+ parameters, to reduce hallucinations and ensure clinical safety. These models—totaling over a trillion parameters—run on NVIDIA H200 Tensor Core GPUs using Amazon SageMaker HyperPod, delivering ultra-low latency and deeply empathetic conversations.
To enhance the natural flow of dialogue, the system accurately detects when a patient finishes speaking and responds without interruption—critical for building trust in clinical settings. Hippocratic AI leverages TensorRT-LLM to optimize its Polaris constellation of models, making them faster, smaller, and more efficient, which lowers costs and enables more conversations to run on the same hardware.
Given the sensitivity of healthcare data and strict HIPAA compliance, Hippocratic AI uses a robust multi-account, multi-cluster AWS strategy that separates production workloads from development environments. This secure, scalable infrastructure enables thousands of real-time patient interactions while maintaining precise control over performance and privacy.
Beyond the technical foundation, the real-world impact is profound. Hippocratic AI’s assistants help relieve clinician burnout by handling time-consuming tasks—from surgical prep to post-discharge follow-ups. During a recent hurricane in Florida, the system contacted 100,000 patients in one day, providing medication checks and preventative care—outreach that would be impossible to achieve manually.
“With generative AI, patient interactions can be seamless, personalized, and conversational—but in order to have the desired impact, the speed of inference has to be incredibly fast. With the latest advances in LLM inference, speech synthesis, and voice recognition software, NVIDIA’s technology stack is critical to achieving this speed and fluidity.”
Munjal Shah
Co-Founder and CEO of Hippocratic AI
Learn more about NVIDIA solutions for healthcare and life sciences.