The Boundaries of Fair AI in Medical Image Prognosis: A Causal Perspective
Thai-Hoang Pham, Jiayuan Chen, Seungyeon Lee, Yuanlong Wang, Sayoko Moroi, Xueru Zhang, Ping Zhang
TL;DR
This work tackles fairness in medical-image prognosis by introducing FairTTE, a causal framework for analyzing sources of bias in time-to-event predictions. It presents a unified SCM-based setup, decomposes bias into identifiable components, and benchmarks three TTE models across three public datasets with multiple sensitive attributes. Across extensive experiments on over 20,000 models and five fairness methods, it documents pervasive bias and limited mitigation, with fairness tied to distribution shifts and invariant pathway considerations. The findings underscore the need for robust, holistic fairness approaches in prognostic imaging, beyond transferring insights from diagnostic tasks.
Abstract
As machine learning (ML) algorithms are increasingly used in medical image analysis, concerns have emerged about their potential biases against certain social groups. Although many approaches have been proposed to ensure the fairness of ML models, most existing works focus only on medical image diagnosis tasks, such as image classification and segmentation, and overlooked prognosis scenarios, which involve predicting the likely outcome or progression of a medical condition over time. To address this gap, we introduce FairTTE, the first comprehensive framework for assessing fairness in time-to-event (TTE) prediction in medical imaging. FairTTE encompasses a diverse range of imaging modalities and TTE outcomes, integrating cutting-edge TTE prediction and fairness algorithms to enable systematic and fine-grained analysis of fairness in medical image prognosis. Leveraging causal analysis techniques, FairTTE uncovers and quantifies distinct sources of bias embedded within medical imaging datasets. Our large-scale evaluation reveals that bias is pervasive across different imaging modalities and that current fairness methods offer limited mitigation. We further demonstrate a strong association between underlying bias sources and model disparities, emphasizing the need for holistic approaches that target all forms of bias. Notably, we find that fairness becomes increasingly difficult to maintain under distribution shifts, underscoring the limitations of existing solutions and the pressing need for more robust, equitable prognostic models.
