Open-R1: a fully open reproduction of DeepSeek-R1
Back to Articles
What is DeepSeek-R1?
How did they do it?
Open-R1: the missing pieces
What is DeepSeek-R1?
If you’ve ever struggled with a tough math problem, you know how useful it is to think a little longer and work through it carefully. OpenAI’s o1 model showed that when LLMs are trained to do the same—by using more compute during inference—they get significantly better at solving reasoning tasks like mathematics, coding, and logic.
However, the recipe behind OpenAI’s reasoning models has be...
Read more at huggingface.co