Gemini Diffusion

Published on May 30, 2025

First impressions

I just got access to Gemini Diffusion. The speed is mind-blowing. From what I can tell, the model diffuses blocks of text sequentially—about two paragraphs per block. Each finished block is fed back in as context for the next one.

I tried it on a handful of tasks:

Formatting can be a bit messy, and you sometimes get repetition between neighboring paragraphs. It also struggles to surface some of the relevant nuggets of information when the user prompt is huge (e.g. the car rental agreement). Still, overall performance is quite impressive.

Riddle and Chain-of-Thought

I also tried to get the model to solve a riddle. It got it wrong. Gemini Diffusion screenshot showing the riddle and the wrong answer My next thought was to try Chain-of-Thought (CoT) prompting. Two things happened:

Gemini Diffusion screenshot showing the riddle prompt Gemini Diffusion screenshot showing the riddle reply

Questions and thoughts