Kaggle Benchmarks now lets you design and test AI evaluations directly from your local environment, without relying solely on the web notebook editor. Sounds like a small change? It has real impact: it speeds up the idea-to-proof cycle and puts tools in the hands of the people who use models every day.
What changes with local development
Until now, creating evaluation tasks on Kaggle meant working in its web notebook editor. Great if you live there, but awkward if your workflow is VSCode, Cursor, Antigravity, or code agents.
With the new local development feature you can create, validate, push, run, and download tasks from your machine. In short, the infrastructure adapts to your stack, not the other way around. Less friction, more experimentation.
How it works in simple terms
The integration uses the kaggle-benchmarks SDK and new commands in the Kaggle CLI. You don’t need to be an expert: you can write the evaluation in natural language and let a coding agent transform it into a task ready for Kaggle.
