Metric Compatible Training for Online Backfilling in Large-Scale Retrieval
Seonguk Seo, Mustafa Gokhan Uzunbas, Bohyung Han, Sara Cao, Ser-Nam Lim
TL;DR
The paper addresses the high cost and downtime of re-indexing gallery embeddings after large-scale image retrieval model upgrades. It introduces online backfilling via distance rank merge, a reverse query transform for single-pass inference, and metric-compatible contrastive learning to calibrate distances across old and new embedding spaces, with an optional learnable new embedding. The approach achieves monotonic performance gains during backfilling and preserves or surpasses final offline performance across four benchmarks, outperforming prior backward-compatible and online backfilling methods. The work offers a practical, low-overhead solution for rapid, high-quality model upgrades in real-world retrieval systems, with strong robustness to homogeneous, heterogeneous, and open-class settings.
Abstract
Backfilling is the process of re-extracting all gallery embeddings from upgraded models in image retrieval systems. It inevitably requires a prohibitively large amount of computational cost and even entails the downtime of the service. Although backward-compatible learning sidesteps this challenge by tackling query-side representations, this leads to suboptimal solutions in principle because gallery embeddings cannot benefit from model upgrades. We address this dilemma by introducing an online backfilling algorithm, which enables us to achieve a progressive performance improvement during the backfilling process while not sacrificing the final performance of new model after the completion of backfilling. To this end, we first propose a simple distance rank merge technique for online backfilling. Then, we incorporate a reverse transformation module for more effective and efficient merging, which is further enhanced by adopting a metric-compatible contrastive learning approach. These two components help to make the distances of old and new models compatible, resulting in desirable merge results during backfilling with no extra computational overhead. Extensive experiments show the effectiveness of our framework on four standard benchmarks in various settings.
