MIMM-X: Disentangling Spurious Correlations for Medical Image Analysis
Louisa Fay, Hajer Reguigui, Bin Yang, Sergios Gatidis, Thomas Küstner
TL;DR
The paper tackles shortcut learning in medical imaging caused by dataset heterogeneity by introducing MIMM-X, a framework that disentangles a primary task from multiple spurious correlations via mutual information minimization. It extends prior MI-based disentanglement with a Confounder Attention Weighter and GradNorm-inspired dynamic loss scaling to handle several spurions simultaneously. Validation across brain MRI (NAKO/UKB) and chest X-ray (CheXpert) demonstrates improved generalization under induced and natural distribution shifts and effective disentanglement of causal and spurious features. This work advances causal representation learning in clinical imaging, offering a scalable approach to robust, fair predictions without altering data distributions.
Abstract
Deep learning models can excel on medical tasks, yet often experience spurious correlations, known as shortcut learning, leading to poor generalization in new environments. Particularly in medical imaging, where multiple spurious correlations can coexist, misclassifications can have severe consequences. We propose MIMM-X, a framework that disentangles causal features from multiple spurious correlations by minimizing their mutual information. It enables predictions based on true underlying causal relationships rather than dataset-specific shortcuts. We evaluate MIMM-X on three datasets (UK Biobank, NAKO, CheXpert) across two imaging modalities (MRI and X-ray). Results demonstrate that MIMM-X effectively mitigates shortcut learning of multiple spurious correlations.
