NVIDIA and Reachy Mini: physical agents with DGX Spark

Today at CES 2026 NVIDIA showed how to turn AI agents into physical, desktop companions using DGX Spark and Reachy Mini. Can you imagine an assistant that sees you, talks to you, and can move an arm to help — all controlled by open models and running on your own hardware? Here I walk you through the architecture, the components, and how to replicate the demo step by step.

Qué presentó NVIDIA en CES 2026

NVIDIA combined several open building blocks to create agents that think, see, and act in the real world. Key pieces include:

Reasoning models: NVIDIA Nemotron (example: Nemotron 3 Nano).
Vision models: Nemotron Nano 2 VL (VLM).
Text-to-speech: the demo uses ElevenLabs as an example.
Orchestration: NeMo Agent Toolkit (NeMo Agent Toolkit + nat).
Compute hardware: DGX Spark for local, accelerated inference.
Physical or simulated endpoint: Reachy Mini (or its simulator).

The idea is simple and powerful: don’t rely on a single "do-it-all" model. Instead, combine specialized models (text, vision, voice), a router that decides where each request goes, and a tools layer that performs physical actions or external queries.

Qué presentó NVIDIA en CES 2026

Por qué Reachy Mini importa

Arquitectura del demo y componentes clave

Orquestación, routing y llamadas a herramientas (tool calling)

Pasos prácticos para replicar la demo

Consideraciones de seguridad y rendimiento

Dónde seguir si quieres profundizar

Fuente original

Stay up to date!

NVIDIA and Reachy Mini: physical agents with DGX Spark