OpenAI secretly funded AI math benchmark FrontierMath; sets record with o3 model, sparking transparency concerns

OpenAI quietly funded independent math benchmark before setting record with o3

OpenAI's involvement in funding FrontierMath, a leading AI math benchmark, only came to light when the company announced its record-breaking performance on the test. Now, the benchmark's developer Epoch AI acknowledges they should have been more transparent about the relationship. FrontierMath, introduced in November 2024, tests how well AI systems can tackle complex mathematical problems that require advanced reasoning and problem-solving skills - the kind of tasks that typically stump even the...