Multi-label Image Classification using Adaptive Graph Convolutional Networks: from a Single Domain to Multiple Domains
Indel Pal Singh, Enjie Ghorbel, Oyebade Oyedotun, Djamila Aouada
TL;DR
This work introduces Multi-Label Adaptive Graph Convolutional Network (ML-AGCN), which replaces heuristically predefined label graphs with end-to-end learned adjacency matrices that capture both edge importance (attention-based) and preserved feature similarity (cosine-based). The approach yields improved multi-label image classification (single-domain) and extends to unsupervised domain adaptation (DA-AGCN) via adversarial training to align source and target domains. Empirical results on MS-COCO, VG-500, and VOC demonstrate competitive mAP with substantially smaller model sizes, and the DA extension shows strong gains across aerial and cross-domain benchmarks, often outperforming state-of-the-art baselines. The combination of adaptive graph topology learning and domain-alignment mechanisms provides a compact yet effective framework for MLIC across domains, with the potential for open-set extensions in future work.
Abstract
This paper proposes an adaptive graph-based approach for multi-label image classification. Graph-based methods have been largely exploited in the field of multi-label classification, given their ability to model label correlations. Specifically, their effectiveness has been proven not only when considering a single domain but also when taking into account multiple domains. However, the topology of the used graph is not optimal as it is pre-defined heuristically. In addition, consecutive Graph Convolutional Network (GCN) aggregations tend to destroy the feature similarity. To overcome these issues, an architecture for learning the graph connectivity in an end-to-end fashion is introduced. This is done by integrating an attention-based mechanism and a similarity-preserving strategy. The proposed framework is then extended to multiple domains using an adversarial training scheme. Numerous experiments are reported on well-known single-domain and multi-domain benchmarks. The results demonstrate that our approach achieves competitive results in terms of mean Average Precision (mAP) and model size as compared to the state-of-the-art. The code will be made publicly available.
