Lighter, Better, Faster Multi-Source Domain Adaptation with Gaussian Mixture Models and Optimal Transport
Eduardo Fernandes Montesuma, Fred Ngolè Mboula, Antoine Souloumiac
TL;DR
The paper tackles Multi-Source Domain Adaptation under distribution shift by proposing an optimal-transport-based framework built on Gaussian Mixture Models. It introduces two strategies, GMM-Wasserstein Barycenter Transport (GMM-WBT) and GMM-DaDiL, alongside a parametric approach to compute mixture-Wasserstein barycenters and a supervised variant that incorporates class labels. The core contributions include a first-order mapping of GMM components under MW2, a supervised mixture-Wasserstein distance, and efficient barycenter algorithms tailored for GMMs, enabling lighter and faster MSDA. Empirical results on four benchmarks show that the proposed methods outperform or match prior art while reducing parameter counts and computation time, highlighting the approach's scalability and practical impact for domain adaptation tasks.
Abstract
In this paper, we tackle Multi-Source Domain Adaptation (MSDA), a task in transfer learning where one adapts multiple heterogeneous, labeled source probability measures towards a different, unlabeled target measure. We propose a novel framework for MSDA, based on Optimal Transport (OT) and Gaussian Mixture Models (GMMs). Our framework has two key advantages. First, OT between GMMs can be solved efficiently via linear programming. Second, it provides a convenient model for supervised learning, especially classification, as components in the GMM can be associated with existing classes. Based on the GMM-OT problem, we propose a novel technique for calculating barycenters of GMMs. Based on this novel algorithm, we propose two new strategies for MSDA: GMM-Wasserstein Barycenter Transport (WBT) and GMM-Dataset Dictionary Learning (DaDiL). We empirically evaluate our proposed methods on four benchmarks in image classification and fault diagnosis, showing that we improve over the prior art while being faster and involving fewer parameters. Our code is publicly available at https://github.com/eddardd/gmm_msda
