SIGMA: Search-Augmented On-Demand Knowledge Integration for Agentic Mathematical Reasoning
Ali Asgarov, Umid Suleymanov, Aadyant Khatri
TL;DR
Mathematical reasoning requires both accurate knowledge and careful multi-step deduction, and single-perspective retrieval often fails on knowledge-intensive tasks. SIGMA introduces a multi-agent framework with four specialist agents (Factual, Logical, Computational, Completeness) plus a moderator to perform independent reasoning-search cycles and integrate results via Hypothetical Document Enhancement for on-demand retrieval. The approach yields consistent gains over strong baselines on $MATH500$, $AIME$, $AMC$, and $GPQA$, including an absolute improvement of $7.4\%$ over competing methods, while remaining efficient at smaller model scales. This work demonstrates that on-demand, multi-perspective knowledge integration is a scalable and effective paradigm for complex mathematical problem solving and potentially other knowledge-intensive domains.
Abstract
Solving mathematical reasoning problems requires not only accurate access to relevant knowledge but also careful, multi-step thinking. However, current retrieval-augmented models often rely on a single perspective, follow inflexible search strategies, and struggle to effectively combine information from multiple sources. We introduce SIGMA (Search-Augmented On-Demand Knowledge Integration for AGentic Mathematical reAsoning), a unified framework that orchestrates specialized agents to independently reason, perform targeted searches, and synthesize findings through a moderator mechanism. Each agent generates hypothetical passages to optimize retrieval for its analytic perspective, ensuring knowledge integration is both context-sensitive and computation-efficient. When evaluated on challenging benchmarks such as MATH500, AIME, and PhD-level science QA GPQA, SIGMA consistently outperforms both open- and closed-source systems, achieving an absolute performance improvement of 7.4%. Our results demonstrate that multi-agent, on-demand knowledge integration significantly enhances both reasoning accuracy and efficiency, offering a scalable approach for complex, knowledge-intensive problem-solving. We will release the code upon publication.
