Anthropic publishes guide for long-running agents

AI is no longer just quick answers. You're increasingly asking agents to work for hours or even days on complex projects, but limited contexts break continuity. How do you get an agent to make coherent progress when every session starts without memory?

The problem of long-running agents

Imagine a software project where every shift shows up with no memory of what came before. Sounds chaotic, right? That's exactly what happens when an agent has to work across multiple context windows: each session is discrete and the next one remembers nothing.

Advanced models (for example Opus 4.5 in the Claude Agent SDK) have tools like compaction to save tokens and keep relevant information. But compaction isn't enough: the agent might try to "do it all" in a single session and leave work half-done, or declare the project finished before it actually is. Result? Incomplete code, missing documentation, and sessions that waste time trying to understand the past.

The problem of long-running agents

Practical solution: initializer agent + coding agent

The problem of long-running agents

Practical solution: initializer agent + coding agent

Initializer agent

Coding agent

Environment management and key files

Testing and verification

Typical session flow

Failure modes and countermeasures

Future and open questions

Original source

Stay up to date!

Anthropic publishes guide for long-running agents