DR Tulu launches open recipe for deep research

DR Tulu is an open bet for models to do deep research: plan, search and synthesize information from many sources to produce long, justified answers with clear citations. Sounds complex? Yes. Is it useful today? Also.

What DR Tulu is and why it matters

DR Tulu is the first open model trained specifically for long-form research tasks using an end-to-end recipe that combines SFT (supervised fine-tuning) and a new variant of RL they call RLER (Reinforcement Learning with Evolving Rubrics). The main idea is to train agents that don’t just answer, but investigate: plan, call search tools, gather evidence and document each claim with verifiable citations.

Why is this relevant now? Because many powerful research agents are proprietary. DR Tulu proposes a reproducible alternative: model, code, agent library and the full recipe under a permissive license.

Practically speaking: DR Tulu-8B ties with or outperforms several proprietary agents on long-research benchmarks, at a cost per query thousands of times lower.

What DR Tulu is and why it matters

How it works: agent, tools and MCP

The training recipe: SFT + RLER

SFT to get started cold

RLER: rewards that evolve

Results on benchmarks and efficiency

Clinical case: genetics and current limits

Practical design: reproducibility and use

Final reflection

Original source

Stay up to date!

DR Tulu launches open recipe for deep research