MixBCT: Towards Self-Adapting Backward-Compatible Training
Yu Liang, Yufeng Zhang, Shiliang Zhang, Yaowei Wang, Sheng Xiao, Rong Xiao, Xiaoyu Wang
TL;DR
MixBCT tackles backward-compatible training for large-scale retrieval by enabling a new model to leverage distribution information from old features without updating the old gallery. It introduces a simple, unified approach that mixes old and new features and trains the new classifier on these mixed representations, with an adaptive constraint informed by old feature dispersion. The method uses a single loss term, optionally denoises old features to form a more robust mixed representation, and demonstrates strong gains over prior prototype-based and instance-based BCT methods on MS1Mv3 and IJB-C across open-set and cross-model scenarios. This yields a practical, backfill-free deployment path that remains effective across varying old-model qualities and data-split configurations.
Abstract
Backward-compatible training circumvents the need for expensive updates to the old gallery database when deploying an advanced new model in the retrieval system. Previous methods achieved backward compatibility by aligning prototypes of the new model with the old one, yet they often overlooked the distribution of old features, limiting their effectiveness when the low quality of the old model results in a weakly feature discriminability. Instance-based methods like L2 regression take into account the distribution of old features but impose strong constraints on the performance of the new model itself. In this paper, we propose MixBCT, a simple yet highly effective backward-compatible training method that serves as a unified framework for old models of varying qualities. We construct a single loss function applied to mixed old and new features to facilitate backward-compatible training, which adaptively adjusts the constraint domain for new features based on the distribution of old features. We conducted extensive experiments on the large-scale face recognition datasets MS1Mv3 and IJB-C to verify the effectiveness of our method. The experimental results clearly demonstrate its superiority over previous methods. Code is available at https://github.com/yuleung/MixBCT .
