SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models
Hyeonwoo Kim, Gyoungjin Gim, Yungi Kim, Jihoo Kim, Byungju Kim, Wonseok Lee, Chanjun Park
TL;DR
This work addresses the gap in mathematical reasoning in large language models by introducing SAAS, a sequential learning framework that first trains on Chain-of-Thought (CoT) reasoning and then on Program-of-Thought (PoT) with a cognitive retention mechanism. The approach aims to amplify solving ability by solidifying logical skills before delegating computation to code-like PoT reasoning, and by mixing CoT samples into the PoT phase to prevent cognitive forgetting. Extensive experiments on GSM8K, MATH, and related benchmarks show SAAS achieving state-of-the-art results across multiple model scales, with ablations highlighting the importance of sequencing and retention. The findings suggest that carefully ordered CoT-to-PoT training, safeguarded by cognitive retention, can substantially enhance both mathematical reasoning and computational accuracy in LLMs, offering a practical route toward more reliable mathematical problem solving in real-world applications.
Abstract
This study presents a novel learning approach designed to enhance both mathematical reasoning and problem-solving abilities of Large Language Models (LLMs). We focus on integrating the Chain-of-Thought (CoT) and the Program-of-Thought (PoT) learning, hypothesizing that prioritizing the learning of mathematical reasoning ability is helpful for the amplification of problem-solving ability. Thus, the initial learning with CoT is essential for solving challenging mathematical problems. To this end, we propose a sequential learning approach, named SAAS (Solving Ability Amplification Strategy), which strategically transitions from CoT learning to PoT learning. Our empirical study, involving an extensive performance comparison using several benchmarks, demonstrates that our SAAS achieves state-of-the-art (SOTA) performance. The results underscore the effectiveness of our sequential learning approach, marking a significant advancement in the field of mathematical reasoning in LLMs.
