AnyEdit: Edit Any Knowledge Encoded in Language Models
Houcheng Jiang, Junfeng Fang, Ningyu Zhang, Guojun Ma, Mingyang Wan, Xiang Wang, Xiangnan He, Tat-seng Chua
TL;DR
AnyEdit introduces an autoregressive, chunk-based knowledge editing framework to update long-form and diverse-formatted knowledge encoded in large language models. Grounded in the chain rule of mutual information, it sequentially edits chunks to avoid interference and scale to arbitrary lengths, while remaining plug-and-play with existing editing methods. Empirically, AnyEdit achieves about a 21.5% average improvement on long-form and diverse-form benchmarks and demonstrates strong generalization across domains, with insights into chunk sizing for practical deployment. The work also contributes the EditEverything dataset and shows that chunk-based autoregressive editing significantly broadens the scope and practicality of knowledge editing in LLMs.
Abstract
Large language models (LLMs) often produce incorrect or outdated information, necessitating efficient and precise knowledge updates. Current model editing methods, however, struggle with long-form knowledge in diverse formats, such as poetry, code snippets, and mathematical derivations. These limitations arise from their reliance on editing a single token's hidden state, a limitation we term "efficacy barrier". To solve this, we propose AnyEdit, a new autoregressive editing paradigm. It decomposes long-form knowledge into sequential chunks and iteratively edits the key token in each chunk, ensuring consistent and accurate outputs. Theoretically, we ground AnyEdit in the Chain Rule of Mutual Information, showing its ability to update any knowledge within LLMs. Empirically, it outperforms strong baselines by 21.5% on benchmarks including UnKEBench, AKEW, and our new EditEverything dataset for long-form diverse-formatted knowledge. Additionally, AnyEdit serves as a plug-and-play framework, enabling current editing methods to update knowledge with arbitrary length and format, significantly advancing the scope and practicality of LLM knowledge editing.
