Connecting developers to a global network of GPU compute.
Overview
Meet NVIDIA DGX™ Cloud Lepton, an AI platform that unifies your compute so you can build, train, and deploy without thinking about infrastructure. Designed for AI natives, model builders, and teams that iterate quickly, DGX Cloud Lepton brings compute environments together into one consistent experience for development, training, and inference. You keep a single workflow, avoid rearchitecting when compute changes, and move from prototype to production faster across the regions and providers you choose.
NVIDIA DGX Cloud Lepton is an AI platform that connects developers to global GPU compute across a network of cloud providers. This platform offers developers a unified experience and unmatched flexibility to develop and scale AI apps across multi-cloud environments.
How It Works
DGX Cloud Lepton brings your AI compute together to provide a unified experience for development, training, and inference. The platform includes integrated tools that streamline the path from prototype to production without the complexity of managing underlying infrastructure.
DGX Cloud Lepton brings together a global network of NVIDIA Cloud Partners (NCPs), GPU marketplaces, cloud providers, and local environments to streamline discovery, development, and deployment of AI workloads in a single, developer-friendly platform.
Features and Benefits
Start building with instant access to NVIDIA’s accelerated APIs at build.nvidia.com—including serverless endpoints, prebuilt NVIDIA NIM™ microservices, and GPU-backed compute. When it’s time to scale, NVIDIA DGX Cloud Lepton powers seamless customization and deployment across a global network of GPU cloud providers.
Decouple the AI PaaS platform from the underlying infrastructure to deploy AI applications across multi-cloud environments with minimal operational burden, leveraging integrated services for inference, testing, and training workloads.
Bring compute from specific regions, achieving compliance with data sovereignty regulations and meeting low-latency requirements for sensitive workloads.
Boost productivity with a unified experience delivered across development, training, and inferencing, including the ability to bring best-fit GPUs Cloud Lepton.
Ecosystem
Access NVIDIA accelerated computing from your choice of regions through a vast network of cloud providers.
Next Steps