DP-CRE: Continual Relation Extraction via Decoupled Contrastive Learning and Memory Structure Preservation
Mengyi Huang, Meng Xiao, Ludi Wang, Yi Du
TL;DR
DP-CRE tackles catastrophic forgetting in continual relation extraction by decoupling prior information preservation from new knowledge acquisition. It introduces decoupled contrastive learning for new tasks and a change-amount constraint to preserve memory structure, augmented by multi-task balance and memory-guided prototypes. Empirical results on FewRel and TACRED show state-of-the-art accuracy gains and solid memory-efficiency, with notable robustness to task imbalance. The approach advances practical CRE by stabilizing representations as relation spaces evolve, enabling scalable continual learning in NLP applications.
Abstract
Continuous Relation Extraction (CRE) aims to incrementally learn relation knowledge from a non-stationary stream of data. Since the introduction of new relational tasks can overshadow previously learned information, catastrophic forgetting becomes a significant challenge in this domain. Current replay-based training paradigms prioritize all data uniformly and train memory samples through multiple rounds, which would result in overfitting old tasks and pronounced bias towards new tasks because of the imbalances of the replay set. To handle the problem, we introduce the DecouPled CRE (DP-CRE) framework that decouples the process of prior information preservation and new knowledge acquisition. This framework examines alterations in the embedding space as new relation classes emerge, distinctly managing the preservation and acquisition of knowledge. Extensive experiments show that DP-CRE significantly outperforms other CRE baselines across two datasets.
