on Mar 10, 2025 i took...

LLaDA—Diffusion LLM at first glance

On Feb 27, @InceptionAILabs introduce Mecury to the world, and said it's diffusion large language models.

"We are excited to introduce Mercury, the first commercial-grade diffusion large language model (dLLM)! dLLMs push the frontier of intelligence and speed with parallel, coarse-to-fine text generation." - @InceptionAILabs

What is Diffusion?

Similar with generate image in Stable Diffusion, the model use a parallel method of transforming noise into unmasked, by progressively refining random noise into coherent sequence.

What's the difference?

References

Footnotes

  1. See "Table 3. Comparison in the Poem Completion Task." - https://arxiv.org/pdf/2502.09992