DeepSeek launches DeepSeekMath-V2 AI model with self-verification capabilities, achieving gold-level scores on IMO 2025, CMO 2024; scores 118/120 on Putnam 2024 using theorem-proving and reinforcement learning techniques

deepseek-ai/DeepSeek-Math-V2 · Hugging Face

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning 1. Introduction Large language models have made significant progress in mathematical reasoning, which serves as an important testbed for AI and could impact scientific research if further advanced. By scaling reasoning with reinforcement learning that rewards correct final answers, LLMs have improved from poor performance to saturating quantitative reasoning competitions like AIME and HMMT in one year. However, this approach faces f...