OpenAI publishes policies to protect teens with AI

Today, March 24, 2026, OpenAI publishes a set of prompt-based safety policies to help developers build appropriate protections for teens. They're designed to be used with the open-weight model gpt-oss-safeguard and aim to turn safety goals into operational rules that actually work in real systems.

What was published

OpenAI released safety policies structured as ready-to-use prompts for reasoning models like gpt-oss-safeguard. What does that mean in practice? That you—developers and product teams—get clear templates to turn broad risk definitions into classifiers you can apply to user-generated content.

The first version covers concrete, high-risk areas for teens, including:

Graphic violent content
Graphic sexual content
Harmful body ideals and dangerous behaviors related to body image
Dangerous activities and viral challenges
Romantic or violent roleplay

What was published

The first version covers concrete, high-risk areas for teens, including:

Graphic violent content
Graphic sexual content
Harmful body ideals and dangerous behaviors related to body image
Dangerous activities and viral challenges
Romantic or violent roleplay

What was published

What was published

Why this matters now

How to use it in practice

Limitations and recommendations

A practical step in a necessary direction

Original source

Stay up to date!

OpenAI publishes policies to protect teens with AI