GitHub - huggingface/open-r1: Fully open reproduction of DeepSeek-R1
Open R1
A fully open reproduction of DeepSeek-R1. This repo is a work in progress, let's build it together!
Overview
The goal of this repo is to build the missing pieces of the R1 pipeline such that everybody can reproduce and build on top of it. The project is simple by design and mostly consists of:
src/open_r1 contains the scripts to train and evaluate models as well as generate synthetic data:
grpo.py: trains a model with GRPO on a given dataset.
sft.py: simple SFT of a model on a dataset.
e...
Read more at github.com