Learning Conjecturing from Scratch

Thibault Gauthier; Josef Urban

Learning Conjecturing from Scratch

Thibault Gauthier, Josef Urban

TL;DR

The paper addresses automated conjecturing for induction predicates needed to prove equivalences between OEIS-derived programs. It introduces a self-learning loop that couples neural translation-based predicate generation with SMT evaluation, iteratively refining the predicate set via minimization and selection. On the OEIS ATP Benchmark, the approach discovers and tests second-order induction axioms whose instances are evaluated by SMT, achieving a substantial improvement by solving 5565 of 16197 problems, outperforming traditional induction strategies. The work yields a strong NMT+ATP system and a large dataset of useful induction predicates, offering a scalable path toward autonomous mathematical conjecturing and potential applications in program verification and algorithm optimization.

Abstract

We develop a self-learning approach for conjecturing of induction predicates on a dataset of 16197 problems derived from the OEIS. These problems are hard for today's SMT and ATP systems because they require a combination of inductive and arithmetical reasoning. Starting from scratch, our approach consists of a feedback loop that iterates between (i) training a neural translator to learn the correspondence between the problems solved so far and the induction predicates useful for them, (ii) using the trained neural system to generate many new induction predicates for the problems, (iii) fast runs of the z3 prover attempting to prove the problems using the generated predicates, (iv) using heuristics such as predicate size and solution speed on the proved problems to choose the best predicates for the next iteration of training. The algorithm discovers on its own many interesting induction predicates, ultimately solving 5565 problems, compared to 2265 problems solved by CVC5, Vampire or Z3 in 60 seconds.

Learning Conjecturing from Scratch

TL;DR

Abstract

Learning Conjecturing from Scratch

TL;DR

Abstract

Paper Structure

Table of Contents