Bloom: open-source tool to evaluate AI behaviors

Bloom is a toolbox for researchers who want to measure, quickly and at scale, problematic behaviors in frontier AI models. Why does this matter now? Because manual evaluations are slow, they go out of date, and they can even pollute future training data.

Bloom automates scenario generation and scoring so you can quantify the frequency and severity of a behavior you define. That means you get repeatable numbers instead of handfuls of hand-checked examples — and you can iterate much faster.

Qué es Bloom y para qué sirve

Bloom is an open-source, agent-based framework that turns a behavior description and a seed configuration into a full evaluation suite. Instead of relying on a fixed set of examples, Bloom generates multiple scenarios per run, measures the same behavior, and preserves reproducibility via a seed (configuration file).

Bloom is designed to test concrete traits. In the launch example they tried four alignment-relevant behaviors — hallucination and flattery, long-term instructed sabotage, self-preservation, and self-preferential bias — across 16 models. Results come back in days, not months, and include top-level metrics (elicitation rate, mean presence) as well as exportable transcripts.

Qué es Bloom y para qué sirve

Qué es Bloom y para qué sirve

Cómo funciona (arquitectura de 4 etapas)

Reproducibilidad y configurabilidad

Validación: ¿en qué tanto confías en Bloom?

Caso práctico: sesgo autopreferencial

Cómo empezar y buenas prácticas

Limitaciones y riesgos a considerar

Fuente original

Stay up to date!

Bloom: open-source tool to evaluate AI behaviors