In-Context Editing: Learning Knowledge from Self-Induced Distributions
Siyuan Qi, Bangcheng Yang, Kailin Jiang, Xiaobo Wang, Jiaqi Li, Yifan Zhong, Yaodong Yang, Zilong Zheng
TL;DR
Consistent In-Context Editing (ICE) tackles efficient knowledge updates in large language models by steering learning toward a contextual distribution induced by targeted knowledge, rather than optimizing to a one-hot target. The method combines a standard fine-tuning objective with a contextual consistency loss, enforcing $p_ heta(oldsymbol{x}|ig[oldsymbol{c},oldsymbol{q}ig]) \approx p_ heta(oldsymbol{x}|oldsymbol{q})$ while sampling context-conditioned sequences to guide updates. Empirical results on KnowEdit datasets show ICE achieves high edit accuracy and favorable locality and portability, with strong linguistic quality and robust continual editing capabilities, often outperforming baselines like ROME, MEMIT, FT-L, and FT-M. The work demonstrates that learning toward a contextual distribution yields more stable, generalizable updates, preserving the model’s existing knowledge while incorporating new information, albeit with higher computational costs due to in-context sampling and optional external-context generation. Overall, ICE offers a practical, gradient-based framework for continual knowledge editing with promising implications for personalized and updatable LLM deployments.
Abstract
In scenarios where language models must incorporate new information efficiently without extensive retraining, traditional fine-tuning methods are prone to overfitting, degraded generalization, and unnatural language generation. To address these limitations, we introduce Consistent In-Context Editing (ICE), a novel approach leveraging the model's in-context learning capability to optimize toward a contextual distribution rather than a one-hot target. ICE introduces a simple yet effective optimization framework for the model to internalize new knowledge by aligning its output distributions with and without additional context. This method enhances the robustness and effectiveness of gradient-based tuning methods, preventing overfitting and preserving the model's integrity. We analyze ICE across four critical aspects of knowledge editing: accuracy, locality, generalization, and linguistic quality, demonstrating its advantages. Experimental results confirm the effectiveness of ICE and demonstrate its potential for continual editing, ensuring that the integrity of the model is preserved while updating information.
