Learning Unified Distance Metric Across Diverse Data Distributions with Parameter-Efficient Transfer Learning
Sungyeon Kim, Donghyun Kim, Suha Kwak
TL;DR
This work addresses the problem of learning a unified distance metric across multiple heterogeneous data distributions, a scenario common in real world applications. It introduces PUMA, a parameter efficient approach that freezes a pre trained Vision Transformer and adds stochastic adapters plus a conditional prompt pool to capture both shared and dataset specific information without bias toward dominant distributions. The authors certify their method on a new eight dataset UML benchmark, showing superior universal accuracy and harmonic mean performance while using up to 69x fewer trainable parameters than dataset specific or naive unified baselines. The results indicate that parameter efficient transfer learning can achieve scalable, cross distribution retrieval with strong performance, enabling a single model deployment across diverse data regimes.
Abstract
A common practice in metric learning is to train and test an embedding model for each dataset. This dataset-specific approach fails to simulate real-world scenarios that involve multiple heterogeneous distributions of data. In this regard, we explore a new metric learning paradigm, called Unified Metric Learning (UML), which learns a unified distance metric capable of capturing relations across multiple data distributions. UML presents new challenges, such as imbalanced data distribution and bias towards dominant distributions. These issues cause standard metric learning methods to fail in learning a unified metric. To address these challenges, we propose Parameter-efficient Unified Metric leArning (PUMA), which consists of a pre-trained frozen model and two additional modules, stochastic adapter and prompt pool. These modules enable to capture dataset-specific knowledge while avoiding bias towards dominant distributions. Additionally, we compile a new unified metric learning benchmark with a total of 8 different datasets. PUMA outperforms the state-of-the-art dataset-specific models while using about 69 times fewer trainable parameters.
