Uncovering Context Reliance in Unstructured Knowledge Editing
Zisheng Zhou, Mengqi Zhang, Shiguang Wu, Xiaotian Ye, Chi Zhang, Zhumin Chen, Pengjie Ren
TL;DR
The paper identifies Context Reliance as a fundamental failure mode of next-token prediction (NTP) when editing unstructured knowledge in LLMs, where learned facts become tethered to preceding text and fail to recall when that context is absent. It provides empirical evidence and a gradient-based theoretical explanation for why this context dependence arises, then proposes COIN, a simple framework combining Context Alignment Loss $\mathcal{L}_{\text{align}}$ and Knowledge Consistency Loss $\mathcal{L}_{\text{cons}}$ to decouple knowledge from contextual patterns. Empirical results on Llama3-8B and Qwen2.5-7B across AKEW, UnKEBench, and MQuAKE show COIN substantially mitigates Context Reliance (about 45.2% reduction) and improves unstructured editing success by 25.6%, with strong generalization to structured, multi-hop tasks. This work highlights a path toward robust, context-robust unstructured knowledge editing by rethinking training/edit objectives to internalize knowledge independently of its textual surroundings.
Abstract
Editing Large language models (LLMs) with real-world, unstructured knowledge is essential for correcting and updating their internal parametric knowledge. In this work, we revisit the fundamental next-token prediction (NTP) as a candidate paradigm for unstructured editing. We identify Context Reliance as a critical failure mode of NTP-based approaches, where knowledge acquired from edited text becomes highly dependent on its preceding context, leading to recall failures when that context is absent during inference. This hypothesis is supported by our empirical validation that prepending context during inference recovers knowledge recall. We further theoretically demonstrate that Context Reliance is an inherent consequence of gradient-based optimization, which tends to bind acquired knowledge to a specific aggregated contextual representation. To address this, we propose a simple yet effective COntext-INdependent editing framework (COIN), encouraging model to focus on knowledge within local scope rather than memorizing contextual patterns. Evaluations show that COIN reduces Context Reliance by 45.2% and outperforms strong baselines by 23.6% in editing success rate, highlighting the vital role of mitigating Context Reliance for robust editing.
