News Score: Score the News, Sort the News, Rewrite the Headlines

Diffusion models are interesting

I stumbled across this tweet a week or so back where this company called Inception Labs released a Diffusion LLM (dLLM). Instead of being autoregressive and predicting tokens left to right, here you start all at once and then gradually come up with sensible words simultaneously (start/finish/middle etc. all at once). Something which worked historically for image and video models is now outperforming similar-sized LLMs in code generation.The company also claims 5-10x improvement across speed and ...

Read more at rnikhil.com

© News Score  score the news, sort the news, rewrite the headlines