Table of Contents
Fetching ...

Impact of domain adaptation in deep learning for medical image classifications

Yihang Wu, Ahmad Chaddad

TL;DR

DA addresses domain shifts in medical image classification by aligning source and target distributions in a shared feature space to improve $p(Y|X)$ on the target when labels are scarce. The study simulates 10 deep CNNs with common DA techniques across four public datasets (SC HAM10000, BT MRI, CC Chest X-ray, MC multi-modality) and evaluates across noise, federated learning, interpretability, and calibration tasks. Key findings include about $4.7\%$ BT gains with ResNet34, about $3\%$ gains under Gaussian noise, and $2\%$–$3\%$ improvements on MC, plus improved interpretability (GradCAM++) and calibration (ECE reductions around 2–7%), while some cases show negative transfer or limited FL gains. These results suggest that DA is beneficial for MRI/CT brain-tumor and multi-modality tasks but requires dataset- and architecture-specific tuning, particularly for imbalanced data and federated settings.

Abstract

Domain adaptation (DA) is a quickly expanding area in machine learning that involves adjusting a model trained in one domain to perform well in another domain. While there have been notable progressions, the fundamental concept of numerous DA methodologies has persisted: aligning the data from various domains into a shared feature space. In this space, knowledge acquired from labeled source data can improve the model training on target data that lacks sufficient labels. In this study, we demonstrate the use of 10 deep learning models to simulate common DA techniques and explore their application in four medical image datasets. We have considered various situations such as multi-modality, noisy data, federated learning (FL), interpretability analysis, and classifier calibration. The experimental results indicate that using DA with ResNet34 in a brain tumor (BT) data set results in an enhancement of 4.7\% in model performance. Similarly, the use of DA can reduce the impact of Gaussian noise, as it provides $\sim 3\%$ accuracy increase using ResNet34 on a BT dataset. Furthermore, simply introducing DA into FL framework shows limited potential (e.g., $\sim 0.3\%$ increase in performance) for skin cancer classification. In addition, the DA method can improve the interpretability of the models using the gradcam++ technique, which offers clinical values. Calibration analysis also demonstrates that using DA provides a lower expected calibration error (ECE) value $\sim 2\%$ compared to CNN alone on a multi-modality dataset.

Impact of domain adaptation in deep learning for medical image classifications

TL;DR

DA addresses domain shifts in medical image classification by aligning source and target distributions in a shared feature space to improve on the target when labels are scarce. The study simulates 10 deep CNNs with common DA techniques across four public datasets (SC HAM10000, BT MRI, CC Chest X-ray, MC multi-modality) and evaluates across noise, federated learning, interpretability, and calibration tasks. Key findings include about BT gains with ResNet34, about gains under Gaussian noise, and improvements on MC, plus improved interpretability (GradCAM++) and calibration (ECE reductions around 2–7%), while some cases show negative transfer or limited FL gains. These results suggest that DA is beneficial for MRI/CT brain-tumor and multi-modality tasks but requires dataset- and architecture-specific tuning, particularly for imbalanced data and federated settings.

Abstract

Domain adaptation (DA) is a quickly expanding area in machine learning that involves adjusting a model trained in one domain to perform well in another domain. While there have been notable progressions, the fundamental concept of numerous DA methodologies has persisted: aligning the data from various domains into a shared feature space. In this space, knowledge acquired from labeled source data can improve the model training on target data that lacks sufficient labels. In this study, we demonstrate the use of 10 deep learning models to simulate common DA techniques and explore their application in four medical image datasets. We have considered various situations such as multi-modality, noisy data, federated learning (FL), interpretability analysis, and classifier calibration. The experimental results indicate that using DA with ResNet34 in a brain tumor (BT) data set results in an enhancement of 4.7\% in model performance. Similarly, the use of DA can reduce the impact of Gaussian noise, as it provides accuracy increase using ResNet34 on a BT dataset. Furthermore, simply introducing DA into FL framework shows limited potential (e.g., increase in performance) for skin cancer classification. In addition, the DA method can improve the interpretability of the models using the gradcam++ technique, which offers clinical values. Calibration analysis also demonstrates that using DA provides a lower expected calibration error (ECE) value compared to CNN alone on a multi-modality dataset.
Paper Structure (9 sections, 2 equations, 7 figures, 3 tables)

This paper contains 9 sections, 2 equations, 7 figures, 3 tables.

Figures (7)

  • Figure 1: Pipeline of our framework. The model is trained using source data, and the weights of the model are shared to test on the target data. The adaption loss is calculated using the bottleneck layer (256 dimension).
  • Figure 2: The visualizations of the learned representations using t-SNE for brain tumor, skin cancer and chest cancer. Red points are source samples and blue are target samples. We use ResNet34 as neural network backbone.
  • Figure 3: ROC curves with corresponding AUC values in brain tumor, skin cancer and chest cancer datasets. TPR and FPR represent true positive rate and false positive rate, respectively.
  • Figure 4: Test accuracy (AVG $\pm$ STD) of 10 deep models on BT dataset (noisy training set).
  • Figure 5: Reliability diagram with ECE value of 10 deep models using DA and without DA on Brain Tumor dataset.
  • ...and 2 more figures