News Score: Score the News, Sort the News, Rewrite the Headlines

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

View PDF Abstract:Diffusion language models offer unique benefits over autoregressive models due to their potential for parallelized generation and controllability, yet they lag in likelihood modeling and are limited to fixed-length generation. In this work, we introduce a class of block diffusion language models that interpolate between discrete denoising diffusion and autoregressive models. Block diffusion overcomes key limitations of both approaches by supporting flexible-length generation an...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines