MolmoBot: robotic manipulation trained entirely in simulation

MolmoBot proposes a provocative but practical idea: what if you could train robots that can manipulate real objects without touching a single physical robot during training? AllenAI releases a full suite trained exclusively in simulation that achieves zero-shot transfer to real robots, and sparks an important conversation about how we scale robotics now that perception and reasoning have come so far.

What MolmoBot is and why it matters

MolmoBot is a suite of robotic manipulation policies trained entirely with synthetic data. It's not just a model: it's the whole stack. AllenAI publishes the training data, the tools to generate it (MolmoSpaces), the training code, and a technical report so others can reproduce and extend the work.

Why is this a game changer? Because the biggest practical bottleneck in robotics has been collecting costly, manual real-world data. Projects like Open X-Embodiment and DROID show the scale of that problem: millions of trajectories or hundreds of hours of teleoperation. MolmoBot proposes shifting the bottleneck to designing better virtual worlds—something that scales with compute and open access.

What MolmoBot is and why it matters

How they trained everything in simulation

Relevant technical details

Architectures and tasks

Results: zero-shot sim-to-real and comparisons

Limitations and open questions

What changes for researchers and entrepreneurs

Source

Stay up to date!

MolmoBot: robotic manipulation trained entirely in simulation