News Score: Score the News, Sort the News, Rewrite the Headlines

Mercury: Ultra-Fast Language Models Based on Diffusion

Authors:Inception Labs, Samar Khanna, Siddhant Kharbanda, Shufan Li, Harshit Varma, Eric Wang, Sawyer Birnbaum, Ziyang Luo, Yanis Miraoui, Akash Palrecha, Stefano Ermon, Aditya Grover, Volodymyr Kuleshov View PDF HTML (experimental) Abstract:We present Mercury, a new generation of commercial-scale large language models (LLMs) based on diffusion. These models are parameterized via the Transformer architecture and trained to predict multiple tokens in parallel. In this report, we detail Mercury Co...

Read more at arxiv.org

© News Score  score the news, sort the news, rewrite the headlines