NVIDIA Nemotron

Build enterprise agentic AI with benchmark-winning open reasoning and multimodal foundation models.

Overview

What Is NVIDIA Nemotron?

The NVIDIA Nemotron™ family of multimodal models provides state-of-the-art agentic reasoning for graduate-level scientific reasoning, advanced math, coding, instruction following, tool calling, and visual reasoning.

The models are optimized for different computing platforms: Nano for cost-efficiency and edge deployment, Super for balanced accuracy and compute efficiency on a single GPU, and Ultra for maximum accuracy in data centers.

The Nemotron models are commercially viable with an open license that allows for customization and data control.

NVIDIA Partners With Europe Model Builders to Accelerate Region’s Leap Into AI

European model builders are developing sovereign AI models using NVIDIA Nemotron™. These models will be delivered to Perplexity as NVIDIA NIM™ microservices and hosted on AI infrastructure from NVIDIA cloud partners.

Sovereign AI Agents Think Local, Act Global With NVIDIA AI Factories

New NVIDIA NIM capabilities and the expanded suite of NVIDIA AI Blueprints and recipes streamline full-stack AI development for nations and enterprises.

Benefits

What Does Nemotron Bring to Agentic AI?

High Accuracy

Built on popular open reasoning models for their exceptional knowledge, post-trained with high-quality training data, and aligned to reason like humans, Nemotron models achieve the highest accuracy on leading benchmarks.

High Compute Efficiency

Through the pruning of larger models, the Nemotron family is optimized for top compute efficiency, using NVIDIA TensorRT™-LLM to deliver higher throughput and on-or-off reasoning capabilities.

Commercially Viable

NVIDIA’s post-training data and optimization techniques ensure powerful, transparent, and adaptable models for developers and enterprises. Models and training data are published openly on Hugging Face.

Secure and Simple Deployment

The Nemotron model family, available as optimized NIM microservices, offers peak inference performance and flexible deployment options, ensuring superior security, privacy, and portability.

Models

Models for Diverse Workloads

Nemotron models excel in vision for enterprise optical character recognition (OCR) and in reasoning for building agentic AI. Research models are also available for experimentation and customization.

Nano

Provides superior accuracy for PC and edge devices

Super

Offers the highest accuracy and throughput in its size category to run on a single NVIDIA H100 Tensor Core GPU

Ultra

Delivers the highest agentic AI accuracy for complex systems, optimized for multi-GPU data centers

Technology

Building Blocks for Agentic AI

Start building AI agents with NVIDIA NeMo™ for custom agentic AI, NVIDIA NIM™ for fast, enterprise-ready deployment, and NVIDIA Blueprints for accelerating development with customizable reference workflows.

NVIDIA NIM

  • Speed up deployment of performance-optimized generative AI models.
  • Run your business applications with stable and secure APIs backed by enterprise-grade support.

NVIDIA Blueprints

  • Quickly get started with reference applications for generative AI use cases, such as digital humans and multimodal retrieval-augmented generation (RAG).
  • Accelerate development with Blueprints, which include partner microservices, one or more AI agents, reference code, customization documentation, and a Helm chart for deployment.

NVIDIA NeMo

  • Build, customize, and deploy generative AI and agentic AI.
  • Deliver enterprise-ready large language models (LLMs) with precise data curation, cutting-edge customization, scalable data ingestion, RAG, and accelerated performance.
  • Easily build data flywheels and continuously optimize AI agents with the latest information.

Starting Options

Ways to Get Started With Nemotron

Start Prototyping for Free

Get started with easy-to-use API endpoints for NIM, powered by DGX™ Cloud.

  • Access fully accelerated AI infrastructure.
  • Ensure your data isn't used for model training.
  • No credits, just a simple path to build, test and deploy.

Get in Touch

Talk to an NVIDIA AI specialist about moving generative AI pilots to production with the security, API stability, and support that comes with NVIDIA AI Enterprise.

  • Explore your generative AI use cases.
  • Discuss your technical requirements.
  • Align NVIDIA AI solutions to your goals and requirements.

Adopters

Enterprises Using Nemotron

Resources

Explore the Latest in Nemotron

NVIDIA Launches Family of Open Reasoning Models for Building Agentic AI Platforms

Explore the family, post-trained by NVIDIA, built on Llama, and distilled from DeepSeek-R1, and learn how the models meet business needs for deployment-ready AI agents.

Build Enterprise AI Agents With Advanced Open NVIDIA Llama Nemotron Reasoning Models

Read how NVIDIA developed the Llama Nemotron with reasoning model family, built on top of Llama open models and post-trained with the reasoning expertise of DeepSeek-R1.

Build Custom Reasoning Models to Achieve Advanced Agentic AI Autonomy

Learn how to build or customize reasoning models using various techniques including distillation and reinforcement learning

Next Steps

Ready to Get Started?

Use the right tools and technologies to take NVIDIA Nemotron models from development to production.

Get in Touch

Talk to an NVIDIA product specialist about moving from pilot to production with the security, API stability, and support that comes with NVIDIA AI Enterprise.

Stay Up to Date on NVIDIA Agentic AI News

Get the latest agentic AI news, technologies, breakthroughs, and more sent straight to your inbox.