Diverse Representation Embedding for Lifelong Person Re-Identification
Shiben Liu, Huijie Fan, Qiang Wang, Xiai Chen, Zhi Han, Yandong Tang
TL;DR
This work tackles lifelong person re-identification (LReID), where models must preserve previously learned knowledge while adapting to sequential new tasks with limited old-task data and domain shifts. It introduces Diverse Representation Embedding (DRE), a transformer-based framework that generates multiple overlapping representations per instance via multiple class tokens and a Maximum Embedding (ME) mechanism, and uses an Adaptive Constraint Module (ACM) to integrate and separate these representations. It further proposes Knowledge Update (KU) and Knowledge Preservation (KP) strategies to balance adaptive learning for new information and retention of old knowledge through representation- and logit-level mechanisms, including losses such as $L_{id}$, $L_{trip}$, $L_{LLD}$, $L_{con}$, and $LLS$, with an EMA-based slow-update strategy. Extensive experiments across seen and unseen domains, including occluded datasets, demonstrate that DRE outperforms state-of-the-art methods and exhibits strong generalization and robustness in lifelong settings.
Abstract
Lifelong Person Re-Identification (LReID) aims to continuously learn from successive data streams, matching individuals across multiple cameras. The key challenge for LReID is how to effectively preserve old knowledge while incrementally learning new information, which is caused by task-level domain gaps and limited old task datasets. Existing methods based on CNN backbone are insufficient to explore the representation of each instance from different perspectives, limiting model performance on limited old task datasets and new task datasets. Unlike these methods, we propose a Diverse Representations Embedding (DRE) framework that first explores a pure transformer for LReID. The proposed DRE preserves old knowledge while adapting to new information based on instance-level and task-level layout. Concretely, an Adaptive Constraint Module (ACM) is proposed to implement integration and push away operations between multiple overlapping representations generated by transformer-based backbone, obtaining rich and discriminative representations for each instance to improve adaptive ability of LReID. Based on the processed diverse representations, we propose Knowledge Update (KU) and Knowledge Preservation (KP) strategies at the task-level layout by introducing the adjustment model and the learner model. KU strategy enhances the adaptive learning ability of learner models for new information under the adjustment model prior, and KP strategy preserves old knowledge operated by representation-level alignment and logit-level supervision in limited old task datasets while guaranteeing the adaptive learning information capacity of the LReID model. Compared to state-of-the-art methods, our method achieves significantly improved performance in holistic, large-scale, and occluded datasets.
