Generative AI

NVIDIA NeMo

Build, customize, and deploy generative AI.

Get Started

Video | Solution Brief | For Developers

Introduction
Benefits
Features
Ecosystem
Resources
Next Steps

Introduction
Benefits
Features
Ecosystem
Resources
Next Steps

Learn More

What Is NVIDIA NeMo?

NVIDIA NeMo™ is an end-to-end platform for developing custom generative AI—including large language models (LLMs), multimodal, vision, and speech AI —anywhere. Deliver enterprise-ready models with precise data curation, cutting-edge customization, retrieval-augmented generation (RAG), and accelerated performance. NeMo is a part of the NVIDIA AI Foundry, a platform and service for building custom generative AI models with enterprise data and domain-specific knowledge.

Generative AI Essentials

Get on the fast-track to enterprise transformation with generative AI. This series of on-demand webinars offers a roadmap to accelerated development and deployment, offering the knowledge you need to take full advantage of this breakthrough technology.

Watch Webinars

Explore the Benefits of NVIDIA NeMo for Generative AI

Flexible

Train and deploy generative AI anywhere, across clouds, data centers, and the edge.

Production Ready

Deploy into production with a secure, optimized, full-stack solution that offers support, security, and API stability as part of NVIDIA AI Enterprise.

Increased ROI

Quickly train, customize, and deploy large language models (LLMs), vision, multimodal, and speech AI at scale, reducing time to solution and increasing ROI.

Accelerated Performance

Maximize throughput and minimize LLM training time with multi-node, multi-GPU training and inference.

End-to-End Pipeline

Experience the benefits of a complete solution for the LLM pipeline—from data processing and training to inference of generative AI models.

Complete Solution for Building Enterprise-Ready LLMs

The Features of NVIDIA NeMo

Accelerate the adoption of custom generative AI with NVIDIA NeMo microservices. Our advanced custom generative AI platform is now available as a set of microservices, offering a range of benefits to enterprises across industries.

Accelerate Data Curation

NeMo Curator

NVIDIA NeMo Curator is a GPU-accelerated data-curation tool that enables large-scale, high-quality datasets for pretraining LLMs.

Read the Blog

Try Tutorial Notebooks

Apply for Early Access

NeMo Curator for Accelerated Data Curation for LLMs

Simplify Fine-Tuning

NeMo Customizer

NVIDIA NeMo Customizer is a high-performance, scalable microservice that simplifies fine-tuning and alignment of LLMs for domain-specific use cases, making it easier to adopt generative AI across industries.

Read the Blog

Apply for Early Access

NeMo Customize to Simplify Fine-Tuning of LLMs

Evaluate Models

NeMo Evaluator

NVIDIA NeMo Evaluator provides automatic assessment of custom generative AI models across academic and custom benchmarks on any platform.

Read the Blog

Apply for Early Access

NeMo Evaluator for Evaluating Custom Generative AI Models

Seamless Data Retrieval

NeMo Retriever

NVIDIA NeMo Retriever is a collection of generative AI microservices that enable organizations to seamlessly connect custom models to diverse business data and deliver highly accurate responses.

Check out the following blogs to learn more about NeMo Retriever.

Read the Blog on Developing Production-Grade Text Retrieval Pipelines for RAG

Start Prototyping

NeMo Retriever Generative AI Microservices for Seamless Data Retreival

Generative AI Guardrails

NeMo Guardrails

NVIDIA NeMo Guardrails orchestrates dialog management, ensuring accuracy, appropriateness, and security in smart applications with LLMs. It safeguards organizations overseeing generative AI systems.

See the Demo

Access on GitHub

NeMo Gaurdrails for Safegaurding Generative AI

Generative AI Inference

NVIDIA NIM™

NVIDIA NIM, part of NVIDIA AI Enterprise, is a set of easy-to-use microservices designed for secure, reliable deployment of high-performance AI model inferencing across clouds, data centers, and workstations.

Learn More

Read the Blog

Start Prototyping

NVIDIA Inference Microservices for Generative AI Deployment

Scale Your Business Applications With Generative AI

Experience, prototype, and deploy AI with production-ready APIs that run anywhere.

Get Started

NVIDIA NeMo in the News

Check out the latest NVIDIA press releases to see how NeMo and generative AI are impacting diverse industries, partner collaborations, and more.

Learn More

Leading Adopters Across Industries

Customers
Partners

Resources

Featured

Building and Deploying Generative AI Models

Enterprises are turning to generative AI to revolutionize the way they innovate, optimize operations, and build a competitive advantage. NeMo is an end-to-end, cloud-native framework for curating data, training and customizing foundation models, and running inference at scale. It supports text-to-text, text-to-image, and text-to-3D models and image-to-image generation.

Watch Now

Generative AI for Developers

Unlock the power of generative AI with an accelerated computing platform—including full-stack optimizations, an innovative chip architecture, acceleration libraries, and application development frameworks—and hands-on technical training.

Explore Developer Resources

Elevate Your LLM Skills

Take advantage of our comprehensive LLM learning path, covering fundamental to advanced topics featuring hands-on training developed and delivered by NVIDIA experts. You can opt for the flexibility of self-paced courses or enroll in instructor-led workshops to earn a certificate of competency.

Explore LLM Training

Get Certified by NVIDIA

Showcase your Generative AI skills and advance your career by getting certified by NVIDIA. Our new professional certification program offers two developer exams focusing on proficiency in large language models (LLMs) and multimodal workflow skills.

Learn About Certification

Large Language Models for Enterprise Solutions

Whether you’re a data scientist looking to build custom models or a chief data officer exploring the potential of LLMs for your organization, read on for valuable insights and guidance.

Read Blog See All Enterprise LLM Blogs

Mastering LLM Techniques

Learn what the most pressing challenges are when doing LLM inference. Get a basic understanding of transformer architecture, the attention mechanism in general, and access to practical solutions.

Read Blog See All Technical LLM Blogs

Unlock the Power of Enterprise LLMs With NVIDIA NeMo

See how NVIDIA NeMo helps organizations streamline the development and deployment of custom LLMs, ultimately facilitating seamless integration of AI capabilities within business operations through NVIDIA AI Enterprise.

Read Blog See All NeMo Technical Blogs See All NeMo Enterprise Blogs

Adopt Large Language Models

Explore the latest tools, optimizations, and best practices for large language models.

Watch LLM GTC Sessions

Tap Into the Power of RAG

Tap into the power of retrieval-augmented generation (RAG) with insights and best practices from visionary CEOs, data scientists, and others.

Watch GTC Sessions on RAG

Deploy Large Language Models

Learn how inference for LLMs is driving breakthrough performance for AI-enabled applications and services.

Watch LLM Inference GTC Sessions

Dropbox

Bringing Personalized Generative AI to Customers

Dropbox plans to leverage NVIDIA’s AI foundry to build custom models and improve AI-powered knowledge work with the Dropbox Dash universal search tool and Dropbox AI.

Learn More

Writer

Startup Pens Generative AI Success Story With NeMo

Using NVIDIA NeMo, Writer is building LLMs that are helping hundreds of companies create custom content for enterprise use cases across marketing, training, support, and more.

Learn More

Amdocs

Bringing Custom Generative AI to the Global Telco Industry

Amdocs plans to build custom LLMs for the $1.7 trillion global telecommunications industry using NVIDIA’s AI foundry on Microsoft Azure.

Learn More

Generative AI and LLM News

Stay up to date on the latest breakthroughs and developments, and get notified first when new technologies are available.

Join the Community

Access Developer Resources

Join the NVIDIA Developer Program to get access to generative AI tools, AI models, training, documentation, expert forums, and more.

Join the Developer Program

Accelerate Your Startup

Join the NVIDIA Inception program to get access to generative AI resources, preferred pricing, and exposure to venture capitalists at all stages.

Learn More and Apply

Next Steps

Ready to Get Started?

Get immediate access to training and inference tools to make generative AI model development easy, cost-effective, and fast.

Get Started Apply for Early Access

Multimodal PDF Data Extraction NIM Agent Blueprint

Use NeMo Retriever NIM microservices to unlock highly accurate insights from massive volumes of enterprise data.

Try Now

Get in Touch

Talk to an NVIDIA product specialist about moving from pilot to production with the assurance of security, API stability, and support that comes with NVIDIA AI Enterprise.

Build Custom Generative AI Models With NVIDIA AI Foundry

Access foundation models, enterprise software, accelerated computing, and AI expertise to build, fine-tune, and deploy custom models for your enterprise applications.

Get Started

NVIDIA NeMo

What Is NVIDIA NeMo?

Generative AI Essentials

Explore the Benefits of NVIDIA NeMo for Generative AI

Flexible

Production Ready

Increased ROI

Accelerated Performance

End-to-End Pipeline

Complete Solution for Building Enterprise-Ready LLMs

The Features of NVIDIA NeMo

Accelerate Data Curation

Simplify Fine-Tuning

Evaluate Models

Seamless Data Retrieval

Generative AI Guardrails

Generative AI Inference

Scale Your Business Applications With Generative AI

NVIDIA NeMo in the News

Leading Adopters Across Industries

Customers

Partners

Resources

Building and Deploying Generative AI Models

Generative AI for Developers

Elevate Your LLM Skills

Get Certified by NVIDIA

Large Language Models for Enterprise Solutions

Mastering LLM Techniques

Unlock the Power of Enterprise LLMs With NVIDIA NeMo

Adopt Large Language Models

Tap Into the Power of RAG

Deploy Large Language Models

Bringing Personalized Generative AI to Customers

Startup Pens Generative AI Success Story With NeMo

Bringing Custom Generative AI to the Global Telco Industry

Generative AI and LLM News

Access Developer Resources

Accelerate Your Startup

Next Steps

Ready to Get Started?

Multimodal PDF Data Extraction NIM Agent Blueprint

Get in Touch

Build Custom Generative AI Models With NVIDIA AI Foundry

AI Sweden

Accelerate Industry Applications With LLMs

Amazon

How Amazon and NVIDIA Help Sellers Create Better Product Listings With AI

Amdocs

NVIDIA and Amdocs Bring Custom Generative AI to Global Telco Industry

AWS

NVIDIA Powers Training for Some of the Largest Amazon Titan Foundation Models

Azure

Harnessing the Power of NVIDIA AI Enterprise on Azure Machine Learning

Bria

Bria Builds Responsible Generative AI for Enterprises Using NVIDIA NeMo, Picasso

Cohesity

Unlock Your Data Superpower: NVIDIA Microservices Unleash Enterprise-Grade Secure Generative AI for Cohesity

CrowdStrike

Shaping the Future of AI in the Cybersecurity Domain

Dell

Dell Validated Design for Generative AI With NVIDIA

Deloitte

Unlock the Value of Generative AI Across Enterprise Software Platforms

Domino Data Lab

Domino Offers Production-Ready Generative AI Powered by NVIDIA

Dropbox

Dropbox and NVIDIA to Bring Personalized Generative AI to Millions of Customers

Google Cloud

AI Titans Collaborate to Create Generative AI Magic

HuggingFace

Leading AI Community to Accelerate Data Curation Pipeline

KT

Creating New Customer Experiences With LLMs

Lenovo

New Reference Architecture for Generative AI Based on LLMs

Quantiphi

Enabling Enterprises to Fast-Track Their AI-Driven Journeys

SAP

SAP and NVIDIA Accelerate Generative AI Adoption Across Enterprise Applications Powering Global Industries