Domain Adaptation for Multi-label Image Classification: a Discriminator-free Approach
Inder Pal Singh, Enjie Ghorbel, Anis Kacem, Djamila Aouada
TL;DR
This work tackles unsupervised domain adaptation for multi-label image classification by removing the need for a separate domain discriminator. It reuses the task classifier as an implicit adversarial critic and models the distribution of classifier outputs with a two-component Gaussian Mixture Model, using the Fréchet distance between source and target components as the domain discrepancy. To enable end-to-end learning, it introduces DeepEM, a differentiable EM-inspired block that estimates GMM parameters in a single forward pass, eliminating iterative EM costs. Across diverse domain shifts, the approach achieves state-of-the-art mean Average Precision with fewer parameters and reduced training time, and the authors release the code for practitioners.
Abstract
This paper introduces a discriminator-free adversarial-based approach termed DDA-MLIC for Unsupervised Domain Adaptation (UDA) in the context of Multi-Label Image Classification (MLIC). While recent efforts have explored adversarial-based UDA methods for MLIC, they typically include an additional discriminator subnet. Nevertheless, decoupling the classification and the discrimination tasks may harm their task-specific discriminative power. Herein, we address this challenge by presenting a novel adversarial critic directly derived from the task-specific classifier. Specifically, we employ a two-component Gaussian Mixture Model (GMM) to model both source and target predictions, distinguishing between two distinct clusters. Instead of using the traditional Expectation Maximization (EM) algorithm, our approach utilizes a Deep Neural Network (DNN) to estimate the parameters of each GMM component. Subsequently, the source and target GMM parameters are leveraged to formulate an adversarial loss using the Fréchet distance. The proposed framework is therefore not only fully differentiable but is also cost-effective as it avoids the expensive iterative process usually induced by the standard EM method. The proposed method is evaluated on several multi-label image datasets covering three different types of domain shift. The obtained results demonstrate that DDA-MLIC outperforms existing state-of-the-art methods in terms of precision while requiring a lower number of parameters. The code is made publicly available at github.com/cvi2snt/DDA-MLIC.
