May 6, 2026
AI/MLDFlash: Pushing Gemma 4 to Warp Speed
When Google shipped native multi-token prediction in Gemma 4, the community celebrated. Then DFlash dropped — achieving up to 6x lossless acceleration with block diffusion speculative decoding. Here's how it works and why it matters.