
DiffusionGemma speeds up text generation 4x
DiffusionGemma is an experimental 26B MoE model that generates blocks of text in parallel, achieving up to 4x speed on dedicated GPUs. It's ideal for local, interactive flows and low-latency use cases but trades off some quality versus Gemma 4 in exchange for speed.
