What really matters for person re-identification? A Mixture-of-Experts Framework for Semantic Attribute Importance
Athena Psalta, Vasileios Tsironis, Konstantinos Karantzalos
TL;DR
<p>We address the interpretability gap in person re-identification by introducing MoSAIC-ReID, a Mixture-of-Experts framework where attribute-specific LoRA experts are gated by an oracle router to enable controlled, attribute-wise attribution. Integrated into a CLIP-based Transformer backbone as a residual MoE in the last layers, MoSAIC-ReID permits principled measurement of each semantic attribute's impact on re-ID accuracy while preserving core model capacity. Through GLM, RF-based permutation importance, SHAP values, and hypothesis testing on Market-1501 and DukeMTMC with rich attribute annotations, we find that clothing color—especially lower-body color—and intrinsic attributes are the most influential cues, whereas rare accessories contribute less. The framework provides a rigorous, interpretable methodology for incorporating explicit semantic knowledge into ReID and offers practical guidance for deploying attribute-informed systems across transformer-based backbones.
Abstract
State-of-the-art person re-identification methods achieve impressive accuracy but remain largely opaque, leaving open the question: which high-level semantic attributes do these models actually rely on? We propose MoSAIC-ReID, a Mixture-of-Experts framework that systematically quantifies the importance of pedestrian attributes for re-identification. Our approach uses LoRA-based experts, each linked to a single attribute, and an oracle router that enables controlled attribution analysis. While MoSAIC-ReID achieves competitive performance on Market-1501 and DukeMTMC under the assumption that attribute annotations are available at test time, its primary value lies in providing a large-scale, quantitative study of attribute importance across intrinsic and extrinsic cues. Using generalized linear models, statistical tests, and feature-importance analyses, we reveal which attributes, such as clothing colors and intrinsic characteristics, contribute most strongly, while infrequent cues (e.g. accessories) have limited effect. This work offers a principled framework for interpretable ReID and highlights the requirements for integrating explicit semantic knowledge in practice. Code is available at https://github.com/psaltaath/MoSAIC-ReID
