AI for Rails: an agent that writes tests and runs in CI

In many Rails monoliths, development prioritizes new features and tests get left for later. What happens? You end up with untested code that causes bugs that are hard to reproduce and hours lost debugging.

Mistral introduced an autonomous agent that closes that gap: it reads files in a Rails project, generates or improves RSpec specs, validates style and coverage, and runs everything inside the CI/CD pipeline without human intervention. Sounds like magic? It's applied engineering: it automates repetitive steps and forces tests to actually run.

What problem it solves

Can you imagine a codebase where half the files never had tests? That was the starting point. Teams that write features but skip tests create growing technical debt. The agent acts on concrete files (models, controllers, serializers, mailers, helpers) and takes care of generating or improving the associated specs.

Ruby is dynamic: there is no compilation that catches errors. That complicates the agent: the only reliable way to verify a spec's syntax and validity is to run it. That's why Mistral runs the tests as part of the agent's workflow.

What problem it solves

What problem it solves

How the agent works (practical summary)

Vibe: the platform where the agent runs

Skills and rules for each file type

Key tools: RuboCop and SimpleCov (and why to run them)

LLM-as-a-judge: evaluating quality beyond metrics

The missing parenthesis problem (a useful example)

Experiment results

Limitations and practical warnings

Why this matters for you

Original source

Stay up to date!

AI for Rails: an agent that writes tests and runs in CI