Diffusion Forcing: Next-Token Prediction Meets Full-Sequence Diffusion
https://boyuan.space/diffusion-forcing/loading story #40873818
loading story #40873378
I work in the field and the work is presented in an extremely obtuse manner.
What is the problem you're trying to solve? Are you proposing a new generative model?
loading story #40881679
loading story #40875826
loading story #40872702
Cool work! Curious if this can be applied back in LLMs as a discrete diffusion model with partial masking.