GAPS: Geometry-Aware Problem Solver
Jiaxin Zhang, Yinghui Jiang, Yashar Moshfeghi
TL;DR
GAPS introduces a geometry-aware problem solver that jointly handles geometry CAL and PRV problems via a unified solution-program generator, guided by a problem-type classifier and a joint diagram-text encoder. It advances geometry reasoning by separately generating operators and operands from domain-specific languages, and by enhancing geometry element recognition through a geometry elements enhancement technique, plus a Hierarchical Beam Search to explore operator-operand combinations. Empirical results on UniGeo and PGPS9K show substantial gains, most notably a 5.3% CAL improvement and a 41.1% PRV improvement over Geoformer, with PRV accuracy reaching 97.5%. The work demonstrates that a single generator with a discriminative problem-type module, along with targeted enhancements and data augmentation, can robustly solve diverse geometry problems and guide future multi-type geometry reasoning systems.
Abstract
Geometry problem solving presents a formidable challenge within the NLP community. Existing approaches often rely on models designed for solving math word problems, neglecting the unique characteristics of geometry math problems. Additionally, the current research predominantly focuses on geometry calculation problems, while overlooking other essential aspects like proving. In this study, we address these limitations by proposing the Geometry-Aware Problem Solver (GAPS) model. GAPS is specifically designed to generate solution programs for geometry math problems of various types with the help of its unique problem-type classifier. To achieve this, GAPS treats the solution program as a composition of operators and operands, segregating their generation processes. Furthermore, we introduce the geometry elements enhancement method, which enhances the ability of GAPS to recognize geometry elements accurately. By leveraging these improvements, GAPS showcases remarkable performance in resolving geometry math problems. Our experiments conducted on the UniGeo dataset demonstrate the superiority of GAPS over the state-of-the-art model, Geoformer. Specifically, GAPS achieves an accuracy improvement of more than 5.3% for calculation tasks and an impressive 41.1% for proving tasks. Notably, GAPS achieves an impressive accuracy of 97.5% on proving problems, representing a significant advancement in solving geometry proving tasks.
