Proving Olympiad Algebraic Inequalities without Human Demonstrations
Chenrui Wei, Mengzhou Sun, Wei Wang
TL;DR
This work tackles the challenge of solving Olympiad-level algebraic inequalities with AI by introducing AIPS, a system that fuses a symbolic deductive engine with a neural value network to navigate vast inequality spaces. It generates a large synthetic corpus of high-quality theorems (about 191k) through forward reasoning and pattern matching, and trains the value network with curriculum learning to guide proof search. On a 20-problem IMO-level inequality benchmark (MO-INT-20), AIPS solves 10 problems, outperforming state-of-the-art baselines and several large language models, with some synthetic theorems reaching IMO-level difficulty. AIPS not only solves problems but also produces human-readable proofs and a set of non-trivial theorems, including one selected for a major city’s Mathematical Olympiad, underscoring its potential to augment mathematical discovery while highlighting current limitations in autonomous definition generation.
Abstract
Solving Olympiad-level mathematical problems represents a significant advancement in machine intelligence and automated reasoning. Current machine learning methods, however, struggle to solve Olympiad-level problems beyond Euclidean plane geometry due to a lack of large-scale, high-quality datasets. The challenge is even greater in algebraic systems, which involve infinite reasoning spaces within finite conditions. To address these issues, we propose AIPS, an Algebraic Inequality Proving System capable of autonomously generating complex inequality theorems and effectively solving Olympiad-level inequality problems without requiring human demonstrations. During proof search in a mixed reasoning manner, a value curriculum learning strategy on generated datasets is implemented to improve proving performance, demonstrating strong mathematical intuitions. On a test set of 20 International Mathematical Olympiad-level inequality problems, AIPS successfully solved 10, outperforming state-of-the-art methods. Furthermore, AIPS automatically generated a vast array of non-trivial theorems without human intervention, some of which have been evaluated by professional contestants and deemed to reach the level of the International Mathematical Olympiad. Notably, one theorem was selected as a competition problem in a major city 2024 Mathematical Olympiad.
