Advancing mathematics research with generative AI
Lisa Carbone
TL;DR
The work surveys how generative AI—particularly LLMs, LRMs, and LCMs—can assist mathematical research by identifying patterns, guiding computational work, and enabling interactive collaboration, while acknowledging fundamental limits in formal deduction and verification. It details the mechanics of tokens, embeddings, and context windows, and discusses how prompt design, chain-of-thought reasoning, and neuro-symbolic verification enhance reliability. The paper also covers practical integrations with Computer Algebra Systems and formal proof assistants (e.g., Lean), case studies in Combinatorial Group Theory, and a future vision of iterative human–AI collaboration that accelerates discovery. Overall, it argues for a hybrid, carefully verified workflow where AI acts as a creative partner, not a sole oracle, in mathematical practice.
Abstract
The main drawback of using generative AI models for advanced mathematics is that these models are not primarily logical reasoning engines. However, Large Language Models, and their refinements, can pick up on patterns in higher mathematics that are difficult for humans to see. By putting the design of generative AI models to their advantage, mathematicians may use them as powerful interactive assistants that can carry out laborious tasks, generate and debug code, check examples, formulate conjectures and more. We discuss how generative AI models can be used to advance mathematics research. We also discuss their integration with neuro-symbolic solvers, Computer Algebra Systems and formal proof assistants such as Lean.
