DeepSeek-R1: New Open-Weights AI Model Excels at Reasoning, Uses Novel Training Method to Generate Long Chains of Thought

The Illustrated DeepSeek-R1

[Draft post, updates to come, please let me know if you have any suggestions or feedback here or on Bluesky or X/Twitter]DeepSeek-R1 is the latest resounding beat in the steady drumroll of AI progress. For the ML R&D community, it is a major release for reasons including: It is an open weights model with smaller, distilled versions and It shares and reflects upon a training method to reproduce a reasoning model like OpenAI O1. In this post, we’ll see how it was built.Contents:Recap: How LLMs are...

Read more at newsletter.languagemodels.co

Leaderboard Submit About