Federated Incremental Named Entity Recognition

Duzhen Zhang; Yahan Yu; Chenxing Li; Jiahua Dong; Dong Yu

Federated Incremental Named Entity Recognition

Duzhen Zhang, Yahan Yu, Chenxing Li, Jiahua Dong, Dong Yu

TL;DR

FINER addresses practical federated NER with streaming new entity types and non-IID clients. The LGFD framework integrates structural knowledge distillation, pseudo-label-guided inter-type contrastive learning, and a task-switching monitor to combat forgetting from both intra- and inter-client perspectives, validated on I2B2 and OntoNotes5. Empirical results show LGFD consistently outperforms state-of-the-art INER baselines under FINER across multiple settings, with ablations confirming the contributions of SKD and ITC and robustness to entity-type order. The work offers a privacy-preserving, scalable approach for continual NER in realistic federated environments, with implications for medical and large-scale language understanding tasks.

Abstract

Federated Named Entity Recognition (FNER) boosts model training within each local client by aggregating the model updates of decentralized local clients, without sharing their private data. However, existing FNER methods assume fixed entity types and local clients in advance, leading to their ineffectiveness in practical applications. In a more realistic scenario, local clients receive new entity types continuously, while new local clients collecting novel data may irregularly join the global FNER training. This challenging setup, referred to here as Federated Incremental NER, renders the global model suffering from heterogeneous forgetting of old entity types from both intra-client and inter-client perspectives. To overcome these challenges, we propose a Local-Global Forgetting Defense (LGFD) model. Specifically, to address intra-client forgetting, we develop a structural knowledge distillation loss to retain the latent space's feature structure and a pseudo-label-guided inter-type contrastive loss to enhance discriminative capability over different entity types, effectively preserving previously learned knowledge within local clients. To tackle inter-client forgetting, we propose a task switching monitor that can automatically identify new entity types under privacy protection and store the latest old global model for knowledge distillation and pseudo-labeling. Experiments demonstrate significant improvement of our LGFD model over comparison methods.

Federated Incremental Named Entity Recognition

TL;DR

Abstract

Federated Incremental Named Entity Recognition

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)