Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

Yoonah Park; Haesung Pyun; Yohan Jo

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

Yoonah Park, Haesung Pyun, Yohan Jo

TL;DR

The paper tackles the known discrepancy where LLMs internalize correct knowledge but falter on MCQs. It uncovers a low-dimensional, geometry-based knowledge–prediction subspace in residual streams, spanned by a knowledge basis and a prediction basis. Through KAPPA, a parameter-free, inference-time affine transformation aligns predictions with latent knowledge, yielding substantial gains on binary and multi-choice MCQs and extending to free-form generation. The approach demonstrates cross-dataset generalization and maintains or improves general capabilities, offering a practical method to elicit more faithful knowledge usage from LLMs. This work provides both a geometric understanding of the gap and a lightweight technique for more accurate model behavior in knowledge-intensive tasks.

Abstract

Large Language Models (LLMs) often fail on multiple-choice questions (MCQs) despite demonstrating correct knowledge in other contexts, such as free-form generation. To investigate the mechanism underlying this knowledge-prediction gap on MCQs and alleviate it, we conduct a probing analysis and find that residual streams in certain layers contain a subspace spanned by two important bases: a \emph{knowledge basis} that encodes the probability of the ground-truth answer for a given MCQ and a \emph{prediction basis} that encodes the probability of the answer choice predicted by the model. We observe that incorrect predictions arise from a misalignment of the model's hidden states along these two bases. Hence, we introduce \textbf{KAPPA} (Knowledge-Aligned Prediction through Projection-based Adjustment), a parameter-free intervention that transforms the hidden states to align the prediction coordinate with the knowledge coordinate within this subspace. Experiments on binary-choice reformulations of Big-Bench-Hard and ARC-Challenge show that KAPPA substantially improves accuracy and consistently outperforms baselines. While optimal subspaces differ across tasks, subspaces generalize to some extent, as supported by cross-dataset experiments. Moreover, KAPPA extends its effectiveness to free-form questions beyond MCQs. Our work provides a new geometric understanding of the knowledge-prediction gap and offers a practical method for better aligning model behavior with its latent knowledge.

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

TL;DR

Abstract

Bridging the Knowledge-Prediction Gap in LLMs on Multiple-Choice Questions

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)