Improve Decoding Factuality by Token-wise Cross Layer Entropy of Large Language Models
Jialiang Wu, Yi Shen, Sijia Liu, Yi Tang, Sen Song, Xiaoyi Wang, Longjun Cai
TL;DR
This work tackles the persistent hallucination problem in large language models by introducing END, a training-free decoding method that leverages token-wise cross-layer prediction changes to quantify the factual knowledge required for each candidate token. END computes a cross-layer distribution across intermediate layers, derives a cross-layer entropy as a factuality signal, and reweights the final token distribution to favor factually grounded tokens. Extensive experiments on TruthfulQA, FACTOR, TriviaQA, and Natural Questions demonstrate that END improves factuality and informativeness while preserving core QA performance, with evidence of generalization across model scales and backbones. The approach offers a new lens on the relationship between internal hidden states and output truthfulness at the token level, and provides a practical, training-free tool for reducing hallucinations in real-world generation tasks.
Abstract
Despite their impressive capacities, Large language models (LLMs) often struggle with the hallucination issue of generating inaccurate or fabricated content even when they possess correct knowledge. In this paper, we extend the exploration of the correlation between hidden-state prediction changes and output factuality into a deeper, token-wise level. Based on the insights , we propose cross-layer Entropy eNhanced Decoding (END), a decoding method that mitigates hallucinations without requiring extra training. END leverages inner probability changes across layers to individually quantify the factual knowledge required for each candidate token, and adjusts the final predicting distribution to prioritize tokens with higher factuality. Experiments on both hallucination and QA benchmarks demonstrate that END significantly enhances the truthfulness and informativeness of generated content while maintaining robust QA accuracy. Moreover, our work provides a deeper perspective on understanding the correlations between inherent knowledge and output factuality.
