Large Language Models have Intrinsic Self-Correction Ability
Dancheng Liu, Amir Nassereldine, Ziming Yang, Chenhui Xu, Yuting Hu, Jiajie Li, Utkarsh Kumar, Changjae Lee, Ruiyang Qin, Yiyu Shi, Jinjun Xiong
TL;DR
This paper argues that intrinsic self-correction in large language models is a real capability, not an artifact of external feedback. It develops a theoretical and empirical framework linking intrinsic SC to chain-of-thought-like reasoning and self-verification, and identifies zero temperature and unbiased prompts as key enablers. Through experiments across multiple models and benchmarks, the authors show that intrinsic SC can improve accuracy when prompts are fair and temperature is kept at zero, with larger models exhibiting stronger SC effects. The work provides practical guidelines for prompting intrinsic SC and contributes to the theoretical understanding of LLM self-correction mechanisms.
Abstract
Large language models (LLMs) have attracted significant attention for their exceptional abilities in various natural language processing tasks, but they suffer from hallucinations that will cause performance degradation. One promising solution to improve the LLMs' performance is to ask LLMs to revise their answer after generation, a technique known as self-correction. Among the two types of self-correction, intrinsic self-correction is considered a promising direction because it does not utilize external knowledge. However, recent works doubt the validity of LLM's ability to conduct intrinsic self-correction. In this paper, we present a novel perspective on the intrinsic self-correction capabilities of LLMs through theoretical analyses and empirical experiments. In addition, we identify two critical factors for successful self-correction: zero temperature and fair prompts. Leveraging these factors, we demonstrate that intrinsic self-correction ability is exhibited across multiple existing LLMs. Our findings offer insights into the fundamental theories underlying the self-correction behavior of LLMs and remark on the importance of unbiased prompts and zero temperature settings in harnessing their full potential.
