Exploring connections of spectral analysis and transfer learning in medical imaging
Yucheng Lu, Dovile Juodelyte, Jonathan D. Victor, Veronika Cheplygina
TL;DR
This work examines how transfer learning for medical imaging is influenced by spectral properties of pre-trained models, particularly their learning priorities across frequency bands. By analyzing gradient-based power spectrum density ($\mathrm{PSD}$) and introducing artificial frequency shortcuts, the authors show clear differences between models pre-trained on natural data (ImageNet) versus medical data (RadImageNet), and that these priors persist during fine-tuning. They demonstrate that alignment between a model’s learning priorities and the spectral content of artifacts leads to shortcut learning, and that source data editing can modulate robustness to these shortcuts. The findings suggest spectral-domain interventions in pre-training could improve transfer-learning robustness in medical imaging, while also raising questions about how source data statistics shape learning priors and how to manipulate them safely and effectively.
Abstract
In this paper, we use spectral analysis to investigate transfer learning and study model sensitivity to frequency shortcuts in medical imaging. By analyzing the power spectrum density of both pre-trained and fine-tuned model gradients, as well as artificially generated frequency shortcuts, we observe notable differences in learning priorities between models pre-trained on natural vs medical images, which generally persist during fine-tuning. We find that when a model's learning priority aligns with the power spectrum density of an artifact, it results in overfitting to that artifact. Based on these observations, we show that source data editing can alter the model's resistance to shortcut learning.
