Winning Gold at IMO 2025 with a Model-Agnostic Verification-and-Refinement Pipeline
View PDF
HTML (experimental)
Abstract:The International Mathematical Olympiad (IMO) is widely regarded as the world championship of high-school mathematics. IMO problems are renowned for their difficulty and novelty, demanding deep insight, creativity, and rigor. Although large language models perform well on many mathematical benchmarks, they often struggle with Olympiad-level problems. Using carefully designed prompts, we construct a model-agnostic, verification-and-refinement pipeline. We dem...
Read more at arxiv.org