Regularization in Spider-Style Strategy Discovery and Schedule Construction
Filip Bártek, Karel Chvalovský, Martin Suda
TL;DR
The paper investigates Spider-style strategy discovery and schedule construction for Vampire on the FOF subset of the TPTP library, demonstrating that a large, diverse pool of strategies can be leveraged to build effective schedules. It shows that a polynomial-time greedy schedule Construction (SCP) can rival or exceed optimal schedules in practice, especially when augmented with regularization techniques that improve generalization to unseen problems. The study provides detailed experimental results across multiple budgets, reveals the trade-offs between training and test performance, and highlights temporal reward and slice-extension regularizations as particularly effective. The findings suggest that robust, fast, and reusable scheduling methods can greatly improve practical theorem proving performance and generalization without resorting to expensive optimal solvers. The work also contributes a dataset and methodology to facilitate further research in strategy discovery and scheduling for ATPs.
Abstract
To achieve the best performance, automatic theorem provers often rely on schedules of diverse proving strategies to be tried out (either sequentially or in parallel) on a given problem. In this paper, we report on a large-scale experiment with discovering strategies for the Vampire prover, targeting the FOF fragment of the TPTP library and constructing a schedule for it, based on the ideas of Andrei Voronkov's system Spider. We examine the process from various angles, discuss the difficulty (or ease) of obtaining a strong Vampire schedule for the CASC competition, and establish how well a schedule can be expected to generalize to unseen problems and what factors influence this property.
