Mistral AI releases Mistral OCR 3, a version that promises to change how businesses and developers turn documents into useful data. Why does this matter? Because extracting text accurately isn't enough anymore; structure, complex tables and handwriting also count.
What is Mistral OCR 3
Mistral OCR 3 is a model designed to extract text and embedded images from a wide variety of documents with high fidelity. It can produce output in markdown and rebuild tables using HTML tags with colspan and rowspan, which makes it easier for downstream systems to interpret not just the content but the document's structure.
The available model is called mistral-ocr-2512 and can be integrated via API. Mistral AI also offers Document AI Playground, a drag-and-drop interface to convert PDFs and images into clean text or structured JSON instantly.
Performance and benchmarks
Mistral reports a significant leap over its previous generation. In their internal tests, they claim a 74% overall win rate against Mistral OCR 2 on forms, scanned documents, complex tables and handwriting. To measure this they used internal benchmarks that reflect real business cases and a fuzzy-match type metric against ground-truth data.
