deepseek-ai/DeepSeek-Math-V2 · Hugging Face
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
1. Introduction
Large language models have made significant progress in mathematical reasoning, which serves as an important testbed for AI and could impact scientific research if further advanced.
By scaling reasoning with reinforcement learning that rewards correct final answers, LLMs have improved from poor performance to saturating quantitative reasoning competitions like AIME and HMMT in one year.
However, this approach faces f...
Read more at huggingface.co