FlexOlmo: modular AI that gives control back to data owners

FlexOlmo is a proposal from the Allen Institute for AI so data owners can take part in training language models without giving up control of their files. Instead of sending texts to a central repository, each organization can train expert modules locally and connect them to a shared model whenever they want. (allenai.org)

What is FlexOlmo and why does it matter?

Can you imagine contributing your database without ever having to publish it? That’s the core idea behind FlexOlmo. The proposal combines a public anchor model with several experts trained independently on closed data. These experts plug into a larger model using a mixture-of-experts architecture, letting data modules be turned on or off at inference time without retraining the whole system. (allenai.org)

This tackles real problems you probably worry about: losing control of data, not being able to remove sensitive information after training, and contributors not getting credit. FlexOlmo enables dynamic opt-in and opt-out, and proposes a way for contributors to receive attribution when their modules are used. ()

What is FlexOlmo and why does it matter?

How it works in simple terms

Key results and validation

Who benefits and when does it make sense?

Limitations and risks you should consider

What this means for open AI

Stay up to date!

FlexOlmo: modular AI that gives control back to data owners