Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
Thomas Hartvigsen, Swami Sankaranarayanan, Hamid Palangi, Yoon Kim, Marzyeh Ghassemi
TL;DR
Aging with GRACE introduces lifelong model editing by attaching discrete codebook-based Adaptors to pre-trained transformers, allowing targeted, input-driven edits without altering weights. Through a deferral-based retrieval mechanism and progressive codebook maintenance, GRACE achieves thousands of sequential edits while preserving training-data behavior and prior corrections. Empirical results across T5, BERT, and GPT2-XL show GRACE outperforms baselines on edit success and retention metrics with compact codebooks and modest inference-time overhead. The approach offers a plug-and-play, parameter-efficient solution for mitigating deployment-time errors such as hallucinations or label shifts, with interpretability via detachable codebooks. Overall, GRACE demonstrates scalable, generalizable, and efficient lifelong editing for large-scale language models.
Abstract
Deployed language models decay over time due to shifting inputs, changing user needs, or emergent world-knowledge gaps. When such problems are identified, we want to make targeted edits while avoiding expensive retraining. However, current model editors, which modify such behaviors of pre-trained models, degrade model performance quickly across multiple, sequential edits. We propose GRACE, a lifelong model editing method, which implements spot-fixes on streaming errors of a deployed model, ensuring minimal impact on unrelated inputs. GRACE writes new mappings into a pre-trained model's latent space, creating a discrete, local codebook of edits without altering model weights. This is the first method enabling thousands of sequential edits using only streaming errors. Our experiments on T5, BERT, and GPT models show GRACE's state-of-the-art performance in making and retaining edits, while generalizing to unseen inputs. Our code is available at https://www.github.com/thartvigsen/grace}.
