Retail/Consumer Packaged Goods

Instacart Powers Smart Cart and Drives Sales Lift With NVIDIA Jetson and Physical AI

Image courtesy of Instacart

Objective

Eighty percent of grocery sales still happen in physical stores, but most retailers have almost no real-time intelligence about what’s happening inside them. Instacart is closing that gap with physical AI—embedding edge computing and multimodal sensors directly into smart shopping carts to deliver double-digit sales lift for retail partners.

By deploying NVIDIA Jetson™ for real-time sensor fusion, Instacart has turned a standard shopping cart into an AI platform that personalizes every trip and gives retailers visibility they’ve never had before.

Customer

Instacart

Use Case

Edge Computing
Computer Vision / Video Analytics
Recommenders / Personalization

Key Takeaways

  • Cut whole-page ranking latency by 65% and increased click-through rates by more than 5%, using NVIDIA Hopper™ GPUs and Dynamo to migrate ad-ranking workloads from CPU to GPU.
  • Scaled Caper Cart deployments across 100+ cities with 3x year-over-year growth,* using a single vision-language model that serves all stores with minimal store-specific retraining.
    *Based on Caper Cart GMV year-over-year.
  • Delivered sales lift for retail partners, using NVIDIA Jetson Orin™ NX modules for real-time sensor fusion in every Caper Cart.

Closing Retail’s Blind Spot—Where 80% of Sales Still Happen

Instacart processed more than $37 billion in annual gross transaction value in 2025 and has facilitated over 1.6 billion lifetime grocery orders across its online business. But the company’s ambition extends well beyond delivery. When Instacart went public, it led with a vision to power every grocery transaction—in-store and online.

The problem was that physical stores, still accounting for roughly 80% of grocery sales, were largely invisible to the technology that could improve them. Retailers had little real-time understanding of shelf conditions, customer movement, or in-store purchasing behavior. Retailer planograms were often inaccurate. Out-of-stock items went undetected for hours. And shrink—whether from intentional theft or honest scanning errors—was rising industrywide, with camera-only and barcode-based approaches unable to keep pace with the complexity of real basket behavior.

At the same time, consumers accustomed to personalized recommendations online were asking: Why can’t the in-store experience feel this smart? Retail partners echoed the demand, asking Instacart to help drive loyalty adoption, media revenue, and omnichannel engagement inside the store. Instacart needed a way to bring the intelligence of its online platform into the physical store—at the speed customers expect, as well as the scale retailers require.

Instacart

Deploying Sensor Fusion at the Shelf Edge With NVIDIA Jetson

Instacart’s answer is Caper Cart—a smart shopping cart equipped with a digital touchscreen, five cameras, a Weights and Measures-certified scale, location sensors, and an NVIDIA Jetson Orin NX module. Every cart is a mobile AI platform, processing multiple sensor streams simultaneously to recognize items as customers drop them into the basket, track quantities, detect removals, and deliver personalized recommendations—all in hundreds of milliseconds.

The architecture runs two systems in parallel. On the edge, the NVIDIA Jetson board fuses visual data from three basket-facing camera angles with weight data and location inputs to build a real-time picture of basket contents. Two additional outward-facing cameras face the store shelves. Weight serves as an X-ray of the basket: Even when cameras are blocked or items are stacked, the scale captures what vision alone cannot. In the cloud, vision-language model encoders run asynchronously for deeper context, and the two feeds combine to interpret user actions, item identity, and shelf context.

Physical AI dynamically perceives and responds to environments in real time. Unlike camera-only or barcode-scanning systems that rely on a single signal, Instacart’s approach fuses multiple sensor modalities directly on hardware embedded in the store environment. It’s what allows the system to handle the messiness of real retail: blocked cameras, stacked items, noisy weight signals from carts bumping along the floor, and tens of thousands of SKUs that shift from store to store.

Side-facing cameras on each cart also solve the location problem. A single model serves all stores with minimal store-specific retraining, which is critical for scaling across a 100,000-store footprint. On the cloud side, NVIDIA GPUs and NVIDIA Dynamo optimize ad-ranking and recommendation workloads. The result: millions of sensor inputs collected daily, feeding a data flywheel that improves with every trip.

Delivering Sales Lift and Real-Time Store Visibility

The gap between where retailers started—navigating their own stores without real-time insight—and where Caper Cart has taken them is measurable. The AI-powered smart cart is driven by personalized, location-aware recommendations that reach customers at the moment of decision.

By incorporating ranking signals from Instacart’s online platform—drawn from 1.6 billion lifetime grocery orders—into in-store recommendations, Instacart measured more than 1% incremental sales lift in rigorous A/B testing. A pre-checkout “got everything you need?” prompt, which combines purchase history, real-time cart location, and live shelf-mapping data to remind customers of forgotten items, drove nearly 1% additional lift. On the cloud inference side, migrating ad and recommendation workloads from CPU to GPU cut whole-page ranking latency by 65% and item ranking latency by 40%, while click-through rates—a direct proxy for revenue—increased by more than 5%.

The same sensor fusion system also tackles shrink, catching item-swapping, stacking, and multi-item discrepancies that camera-only or barcode-scanning approaches miss. Meanwhile, side-facing cameras continuously build a near real-time map of each store—tracking shelf conditions, item locations, and out-of-stock situations—giving retailers operational visibility they have never had before.

Adoption is accelerating. Caper Carts are now live in more than 100 cities—across Kroger, Wegmans, Coles, Schnucks, and Wakefern—with deployment tripling year over year. What started as a convenience play has become a top-line growth driver, and retailers are scaling accordingly.

“At one point, it flipped from retailers thinking about Caper Cart as an efficiency play to this being a revenue generator. That’s what’s driving the groundswell of adoption—retailers are seeing this as a way to generate new revenue in their stores.”

David McIntosh
Chief Connected Stores Officer, Instacart

The Grocery Store Becomes an Intelligent, Self-Optimizing System

Instacart’s ambitions extend well beyond smart carts. The company is building what it calls a grocery world model—a foundational AI system that combines its commerce graph (more than 1.6 billion lifetime orders, over 2 billion product instances refined into more than 22 million unique items) with real-time sensor data from Caper Carts and Instacart shoppers scanning shelves. The goal: a continuously learning system that understands how products relate to each other, how customers make decisions, and how a store operates physically and commercially.

Instacart is already extending this intelligence into agentic workflows. An AI-powered operations assistant lets store managers query shelf conditions in natural language—surfacing out-of-stock patterns, flagging underperforming displays, and moving toward automated merchant coordination. And as the sensor network grows richer—capturing item weight, 3D representations, and precise shelf location—Instacart sees potential to unlock robotics applications in the store, providing the spatial and product intelligence that autonomous systems need to operate.

By fusing edge AI in the physical store with cloud-scale intelligence online, Instacart and NVIDIA are proving that physical AI isn’t a future concept—it’s already reshaping how grocery retail operates, one cart at a time.

NVIDIA AI Podcast: Inside Instacart's AI-Powered Smart Shopping Cart

Listen how Instacart is using edge AI to digitize the grocery store. David McIntosh, Chief Connected Stores Officer at Instacart, discusses how Caper Carts use NVIDIA Jetson, sensor fusion, and real-time data to recognize items, personalize recommendations, support store operations, and help unify online and in-store shopping experiences.

Explore how NVIDIA AI is powering smarter retail stores.

Related Customer Stories