Hacker News new | past | comments | ask | show | jobs | submit

Diffusion Forcing: Next-Token Prediction Meets Full-Sequence Diffusion

https://boyuan.space/diffusion-forcing/
loading story #40873818
loading story #40873378
I work in the field and the work is presented in an extremely obtuse manner.

What is the problem you're trying to solve? Are you proposing a new generative model?

loading story #40881679
loading story #40875826
loading story #40872702
Cool work! Curious if this can be applied back in LLMs as a discrete diffusion model with partial masking.
Very cool, but why is it called diffusion forcing?
loading story #40879514