Continual Memorization of Factoids in Language Models
Howard Chen, Jiayi Geng, Adithya Bhaskar, Dan Friedman, Danqi Chen
TL;DR
This work introduces continual memorization, a two-stage fine-tuning framework in which a model must memorize factoids learned in stage one and retain them after stage two with potentially conflicting data. It reveals that forgetting is especially severe when stage two is another factoid dataset and that replay alone cannot fully mitigate this forgetting. The authors propose REMIX, a simple yet effective data-mixing strategy that incorporates random word sequences and generic pretraining data into training stages, yielding substantial retention gains over baselines and revealing that memorized facts are stored in earlier and more diversified layers. Through analytical and probing methods (e.g., Logit Lens), REMIX is shown to alter memorization dynamics, enabling easier recall and manipulation of facts with minimal impact on downstream performance. The results offer practical guidance for preserving knowledge in LMs during continual updating and open avenues for further study on stability and safety of memorized knowledge.
Abstract
As new knowledge rapidly accumulates, language models (LMs) with pretrained knowledge quickly become obsolete. A common approach to updating LMs is fine-tuning them directly on new knowledge. However, recent studies have shown that fine-tuning for memorization may be ineffective in storing knowledge or may exacerbate hallucinations. In this work, we introduce a setting we call continual memorization, where a model must memorize and retain a set of factoids through multiple stages of fine-tuning on subsequent datasets. We characterized the forgetting patterns through extensive experiments and show that LMs widely suffer from forgetting, especially when needing to memorize factoids in the second stage. We posit that forgetting can be alleviated by modifying training dynamics: (1) protecting the memorization process when learning factoids or (2) reducing interference from subsequent training stages. Intriguingly, we find that mixing randomly generated word sequences or generic data sampled from pretraining corpora at different training stages effectively mitigates forgetting REMIX: Random and Generic Data Mixing). REMIX can recover performance from severe forgetting, outperforming replay methods and other continual learning baselines. We analyze how REMIX influences the learning process and find that robust memorization follows a distinct pattern: the model stores factoids in earlier layers than usual and diversifies the layers that retain them, which results in easier recall and manipulate of the learned factoids.
