Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models

Xinyu Pang; Ruixin Hong; Zhanke Zhou; Fangrui Lv; Xinwei Yang; Zhilong Liang; Bo Han; Changshui Zhang

Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models

Xinyu Pang, Ruixin Hong, Zhanke Zhou, Fangrui Lv, Xinwei Yang, Zhilong Liang, Bo Han, Changshui Zhang

TL;DR

This work tackles the knowledge gaps and misapplication failures of large language models on physics problems by introducing Physics Reasoner, a knowledge-augmented framework that combines a $122$-formula formula set with task-specific checklists and Python-based computation. The system operates in three stages—problem analysis, formula retrieval, and guided reasoning—guided by checklists to improve both knowledge acquisition and correct application, and it achieves a $5.8\%$ average accuracy improvement on the SciBench benchmark. Empirical results show state-of-the-art performance across multiple models and datasets, with ablation studies confirming the necessity of the formula set and checklists, as well as favorable token efficiency. The approach demonstrates the value of explicit physics knowledge and guided application for enhancing LLM-based physics problem solving and suggests that expanding the formula set could further improve capabilities.

Abstract

Physics problems constitute a significant aspect of reasoning, necessitating complicated reasoning ability and abundant physics knowledge. However, existing large language models (LLMs) frequently fail due to a lack of knowledge or incorrect knowledge application. To mitigate these issues, we propose Physics Reasoner, a knowledge-augmented framework to solve physics problems with LLMs. Specifically, the proposed framework constructs a comprehensive formula set to provide explicit physics knowledge and utilizes checklists containing detailed instructions to guide effective knowledge application. Namely, given a physics problem, Physics Reasoner solves it through three stages: problem analysis, formula retrieval, and guided reasoning. During the process, checklists are employed to enhance LLMs' self-improvement in the analysis and reasoning stages. Empirically, Physics Reasoner mitigates the issues of insufficient knowledge and incorrect application, achieving state-of-the-art performance on SciBench with an average accuracy improvement of 5.8%.

Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models

TL;DR

This work tackles the knowledge gaps and misapplication failures of large language models on physics problems by introducing Physics Reasoner, a knowledge-augmented framework that combines a

-formula formula set with task-specific checklists and Python-based computation. The system operates in three stages—problem analysis, formula retrieval, and guided reasoning—guided by checklists to improve both knowledge acquisition and correct application, and it achieves a

average accuracy improvement on the SciBench benchmark. Empirical results show state-of-the-art performance across multiple models and datasets, with ablation studies confirming the necessity of the formula set and checklists, as well as favorable token efficiency. The approach demonstrates the value of explicit physics knowledge and guided application for enhancing LLM-based physics problem solving and suggests that expanding the formula set could further improve capabilities.

Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models

TL;DR

Abstract

Physics Reasoner: Knowledge-Augmented Reasoning for Solving Physics Problems with Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)