CROCODILE: Causality aids RObustness via COntrastive DIsentangled LEarning
Gianluca Carloni, Sotirios A Tsaftaris, Sara Colantonio
TL;DR
This paper tackles domain shift in medical image analysis by proposing CROCODILE, a framework that fuses causal feature disentanglement, dual disease- and domain-prediction branches, contrastive learning, and a prior-knowledge injection to improve robustness across unseen domains. It introduces a Transformer-based architecture that yields disentangled causal ($Q^{ca}$) and spurious ($Q^{sp}$) embeddings, and employs latent causal intervention via backdoor adjustment alongside a Relational Scorer to align cross-branch representations. A task-prior mechanism leverages a causality map over chest X-ray findings to inject medical knowledge and further stabilize learning. Empirically, CROCODILE provides strong OOD generalization and fairness improvements on multi-dataset chest X-ray classification, albeit with some trade-off on in-domain performance, suggesting that causality-driven disentanglement can yield safer, more generalizable medical AI systems. The approach offers a general bias-m mitigation strategy that can extend to other CAD tasks and cross-domain medical imaging applications.
Abstract
Due to domain shift, deep learning image classifiers perform poorly when applied to a domain different from the training one. For instance, a classifier trained on chest X-ray (CXR) images from one hospital may not generalize to images from another hospital due to variations in scanner settings or patient characteristics. In this paper, we introduce our CROCODILE framework, showing how tools from causality can foster a model's robustness to domain shift via feature disentanglement, contrastive learning losses, and the injection of prior knowledge. This way, the model relies less on spurious correlations, learns the mechanism bringing from images to prediction better, and outperforms baselines on out-of-distribution (OOD) data. We apply our method to multi-label lung disease classification from CXRs, utilizing over 750000 images from four datasets. Our bias-mitigation method improves domain generalization and fairness, broadening the applicability and reliability of deep learning models for a safer medical image analysis. Find our code at: https://github.com/gianlucarloni/crocodile.
