Interpreting and Improving Large Language Models in Arithmetic Calculation

Wei Zhang; Chaoqun Wan; Yonggang Zhang; Yiu-ming Cheung; Xinmei Tian; Xu Shen; Jieping Ye

Interpreting and Improving Large Language Models in Arithmetic Calculation

Wei Zhang, Chaoqun Wan, Yonggang Zhang, Yiu-ming Cheung, Xinmei Tian, Xu Shen, Jieping Ye

TL;DR

This work serves as a preliminary exploration into the arithmetic calculation abilities inherent in LLMs, laying a solid foundation to reveal more intricate mathematical tasks.

Abstract

Large language models (LLMs) have demonstrated remarkable potential across numerous applications and have shown an emergent ability to tackle complex reasoning tasks, such as mathematical computations. However, even for the simplest arithmetic calculations, the intrinsic mechanisms behind LLMs remain mysterious, making it challenging to ensure reliability. In this work, we delve into uncovering a specific mechanism by which LLMs execute calculations. Through comprehensive experiments, we find that LLMs frequently involve a small fraction (< 5%) of attention heads, which play a pivotal role in focusing on operands and operators during calculation processes. Subsequently, the information from these operands is processed through multi-layer perceptrons (MLPs), progressively leading to the final solution. These pivotal heads/MLPs, though identified on a specific dataset, exhibit transferability across different datasets and even distinct tasks. This insight prompted us to investigate the potential benefits of selectively fine-tuning these essential heads/MLPs to boost the LLMs' computational performance. We empirically find that such precise tuning can yield notable enhancements on mathematical prowess, without compromising the performance on non-mathematical tasks. Our work serves as a preliminary exploration into the arithmetic calculation abilities inherent in LLMs, laying a solid foundation to reveal more intricate mathematical tasks.

Interpreting and Improving Large Language Models in Arithmetic Calculation

TL;DR

This work serves as a preliminary exploration into the arithmetic calculation abilities inherent in LLMs, laying a solid foundation to reveal more intricate mathematical tasks.

Abstract

Paper Structure (21 sections, 17 figures, 5 tables, 2 algorithms)

This paper contains 21 sections, 17 figures, 5 tables, 2 algorithms.

Introduction
Related Works
Preliminary
Method
Key Components Identification.
Pattern Analysis.
Precise Fine-tuning.
Experiments
Identifying Calculation-related Components.
Understanding Calculation-related Component Behaviors.
Precise SFT on Calculation-related Components.
Conclusion
Appendix
Templates
Evaluate the Effect of Attention Heads.
...and 6 more sections

Figures (17)

Figure 1: The pipeline involves three steps: 1) identify the key components attributed to arithmetic calculations in black-box LLMs, 2) analyze the working mechanism of the key components towards human-understandable explanations, 3) fine-tune the key components to precisely improve the mathematical capability of LLMs.
Figure 2: We conduct path patching experiments on LLaMA2-7B across four mathematical tasks, by searching for each head and MLP directly affecting the logit of the right answer. For each head/MLP, a darker color indicates a larger logit difference from the model before patching.
Figure 3: The influence on prediction accuracy after knocking out top-k attention heads that are sorted by the effect of each head on logits ("effect-rank"), and knocking out randomly-sorted top-k heads ("random-rank").
Figure 4: After knocking out the key heads, LLaMA2-7B predicts incorrectly on the cases of SVAMP dataset and other data formats of multi-digit integers, rational numbers.
Figure 5: The attention score distribution of key heads across four calculation tasks. The key heads (e.g., $13.11$, $14.2$) attend to number operands and calculation operators.
...and 12 more figures

Interpreting and Improving Large Language Models in Arithmetic Calculation

TL;DR

Abstract

Interpreting and Improving Large Language Models in Arithmetic Calculation

Authors

TL;DR

Abstract

Table of Contents

Figures (17)