Computational Algebra with Attention: Transformer Oracles for Border Basis Algorithms
Hiroshi Kera, Nico Pelleriti, Yuki Ishihara, Max Zimmer, Sebastian Pokutta
TL;DR
The paper tackles solving zero-dimensional polynomial systems by augmenting border-basis computations with a Transformer-based oracle that identifies and prunes expensive reductions while guaranteeing correct results. It introduces the Oracle Border Basis Algorithm (OBBA), a data-efficient supervised learning framework that learns to select a small subset of reductions during the final, most time-consuming stage of computation. The authors develop a monomial-centered input representation, border-basis sampling via a new sampling theorem, and an ideal-invariant generator transform to create diverse training data. Empirically, OBBA achieves up to $3.5\times$ speedups over the traditional approach, with solid performance in both in-distribution and out-of-distribution settings, demonstrating a practical integration of learning into symbolic computation.
Abstract
Solving systems of polynomial equations, particularly those with finitely many solutions, is a crucial challenge across many scientific fields. Traditional methods like Gröbner and Border bases are fundamental but suffer from high computational costs, which have motivated recent Deep Learning approaches to improve efficiency, albeit at the expense of output correctness. In this work, we introduce the Oracle Border Basis Algorithm, the first Deep Learning approach that accelerates Border basis computation while maintaining output guarantees. To this end, we design and train a Transformer-based oracle that identifies and eliminates computationally expensive reduction steps, which we find to dominate the algorithm's runtime. By selectively invoking this oracle during critical phases of computation, we achieve substantial speedup factors of up to 3.5x compared to the base algorithm, without compromising the correctness of results. To generate the training data, we develop a sampling method and provide the first sampling theorem for border bases. We construct a tokenization and embedding scheme tailored to monomial-centered algebraic computations, resulting in a compact and expressive input representation, which reduces the number of tokens to encode an $n$-variate polynomial by a factor of $O(n)$. Our learning approach is data efficient, stable, and a practical enhancement to traditional computer algebra algorithms and symbolic computation.
