Transformers.js in Chrome Extensions under Manifest V3

While you were rebuilding the architecture of the Gemma 4 browser assistant, you probably asked yourself: where do I run the model, how do I manage state, and what happens if the service worker is suspended? This technical guide explains the practical recipe to run local inference with Transformers.js inside a Chrome extension under Manifest V3, using the published extension as a deployment map.

Arquitectura general

The division of responsibilities is the project's backbone. In public/manifest.json three clear entry points are defined:

background.service_worker -> compiled file background.js (control and models)
side_panel.default_path -> sidebar.html (persistent chat UI)
content_scripts[] -> content.js (bridge with the webpage)

The design rule: keep heavy orchestration in the background and keep the UI and content scripts lightweight. What do you gain? A single model instance per extension, lower memory usage and security limits respected.

Arquitectura general

¿Quién hace qué?

Mensajería y contratos entre runtimes

Modelos, pipelines y ejecución en Transformers.js

Herramientas y llamadas de función desde el modelo

Ciclo de vida de modelos y resiliencia MV3

Estado y almacenamiento local

Permisos y privacidad

Build y despliegue

Patrones y variaciones prácticas

Recomendaciones finales para desarrolladores

Fuente original

Stay up to date!

Transformers.js in Chrome Extensions under Manifest V3