Mix-Modality Person Re-Identification: A New and Practical Paradigm

Wei Liu; Xin Xu; Hua Chang; Xin Yuan; Zheng Wang

Mix-Modality Person Re-Identification: A New and Practical Paradigm

Wei Liu, Xin Xu, Hua Chang, Xin Yuan, Zheng Wang

TL;DR

This work addresses the gap in cross-modality person re-identification by introducing Mix-Modality Re-Identification (MM-ReID), where both query and gallery contain mixed visible and infrared images. It proposes two technical solutions, Cross-Identity Discrimination Harmonization Loss (CIDHL) and Modality Bridge Similarity Optimization Strategy (MBSOS), to mitigate modality confusion and refine cross-modality distances using identity centers and bridge samples, respectively, within a hyperspherical feature space. The approach is validated on RegDB, SYSU-MM01, and LLCM, showing consistent improvements over state-of-the-art VI-ReID methods under mixed-modality testing. The proposed paradigm and methods offer a practical pathway to robust cross-modality retrieval in real-world surveillance, with potential for deployment and further refinement in mixed-modality settings.

Abstract

Current visible-infrared cross-modality person re-identification research has only focused on exploring the bi-modality mutual retrieval paradigm, and we propose a new and more practical mix-modality retrieval paradigm. Existing Visible-Infrared person re-identification (VI-ReID) methods have achieved some results in the bi-modality mutual retrieval paradigm by learning the correspondence between visible and infrared modalities. However, significant performance degradation occurs due to the modality confusion problem when these methods are applied to the new mix-modality paradigm. Therefore, this paper proposes a Mix-Modality person re-identification (MM-ReID) task, explores the influence of modality mixing ratio on performance, and constructs mix-modality test sets for existing datasets according to the new mix-modality testing paradigm. To solve the modality confusion problem in MM-ReID, we propose a Cross-Identity Discrimination Harmonization Loss (CIDHL) adjusting the distribution of samples in the hyperspherical feature space, pulling the centers of samples with the same identity closer, and pushing away the centers of samples with different identities while aggregating samples with the same modality and the same identity. Furthermore, we propose a Modality Bridge Similarity Optimization Strategy (MBSOS) to optimize the cross-modality similarity between the query and queried samples with the help of the similar bridge sample in the gallery. Extensive experiments demonstrate that compared to the original performance of existing cross-modality methods on MM-ReID, the addition of our CIDHL and MBSOS demonstrates a general improvement.

Mix-Modality Person Re-Identification: A New and Practical Paradigm

TL;DR

Abstract

Mix-Modality Person Re-Identification: A New and Practical Paradigm

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)