VGS-ATD: Robust Distributed Learning for Multi-Label Medical Image Classification Under Heterogeneous and Imbalanced Conditions

Zehui Zhao; Laith Alzubaidi; Haider A. Alwzwazy; Jinglan Zhang; Yuantong Gu

VGS-ATD: Robust Distributed Learning for Multi-Label Medical Image Classification Under Heterogeneous and Imbalanced Conditions

Zehui Zhao, Laith Alzubaidi, Haider A. Alwzwazy, Jinglan Zhang, Yuantong Gu

TL;DR

VGS-ATD introduces a privacy-preserving, distributed learning framework for robust, scalable multi-label medical image classification under data heterogeneity and imbalance. By combining AI-To-Data with Vision Transformers and a one-time backbone aggregation, it enables horizontal, vertical, and hierarchical configurations that support flexible, incremental node addition without full retraining. Across 30 datasets and 80 classes, VGS-ATD consistently outperforms centralized, federated, and swarm baselines in accuracy while delivering substantial reductions in training time and computational cost, and shows resilience to catastrophic forgetting through hierarchical ATD-in-ATD design. The work demonstrates practical potential for real-world, privacy-preserving medical AI systems capable of continuous learning in dynamic clinical environments.

Abstract

In recent years, advanced deep learning architectures have shown strong performance in medical imaging tasks. However, the traditional centralized learning paradigm poses serious privacy risks as all data is collected and trained on a single server. To mitigate this challenge, decentralized approaches such as federated learning and swarm learning have emerged, allowing model training on local nodes while sharing only model weights. While these methods enhance privacy, they struggle with heterogeneous and imbalanced data and suffer from inefficiencies due to frequent communication and the aggregation of weights. More critically, the dynamic and complex nature of clinical environments demands scalable AI systems capable of continuously learning from diverse modalities and multilabels. Yet, both centralized and decentralized models are prone to catastrophic forgetting during system expansion, often requiring full model retraining to incorporate new data. To address these limitations, we propose VGS-ATD, a novel distributed learning framework. To validate VGS-ATD, we evaluate it in experiments spanning 30 datasets and 80 independent labels across distributed nodes, VGS-ATD achieved an overall accuracy of 92.7%, outperforming centralized learning (84.9%) and swarm learning (72.99%), while federated learning failed under these conditions due to high requirements on computational resources. VGS-ATD also demonstrated strong scalability, with only a 1% drop in accuracy on existing nodes after expansion, compared to a 20% drop in centralized learning, highlighting its resilience to catastrophic forgetting. Additionally, it reduced computational costs by up to 50% relative to both centralized and swarm learning, confirming its superior efficiency and scalability.

VGS-ATD: Robust Distributed Learning for Multi-Label Medical Image Classification Under Heterogeneous and Imbalanced Conditions

TL;DR

Abstract

VGS-ATD: Robust Distributed Learning for Multi-Label Medical Image Classification Under Heterogeneous and Imbalanced Conditions

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)