Addressing Fairness Issues in Deep Learning-Based Medical Image Analysis: A Systematic Review
Zikang Xu, Jun Li, Qingsong Yao, Han Li, Mingyue Zhao, S. Kevin Zhou
TL;DR
This paper surveys fairness in deep learning–based medical image analysis, outlining group fairness concepts and key metrics such as Demographic Parity, Accuracy Parity, Equalized Odds, and Equal Opportunity, with formal definitions provided for clarity. It shows that fairness research in MedIA bifurcates into fairness evaluation and unfairness mitigation, synthesizing methods across pre-, in-, and post-processing, and catalogs relevant datasets used for benchmarking. The review highlights widespread subgroup disparities across modalities like brain MRI, dermatology, and chest X-ray, driven by attributes such as sex, age, race, and skin tone, and discusses the tension between mathematical fairness and clinical equity, including challenges posed by foundation models. Finally, it calls for cross-disciplinary collaboration among AI researchers, clinicians, ethicists, and policymakers to develop robust, governance-backed strategies that advance equitable MedIA practice.
Abstract
Deep learning algorithms have demonstrated remarkable efficacy in various medical image analysis (MedIA) applications. However, recent research highlights a performance disparity in these algorithms when applied to specific subgroups, such as exhibiting poorer predictive performance in elderly females. Addressing this fairness issue has become a collaborative effort involving AI scientists and clinicians seeking to understand its origins and develop solutions for mitigation within MedIA. In this survey, we thoroughly examine the current advancements in addressing fairness issues in MedIA, focusing on methodological approaches. We introduce the basics of group fairness and subsequently categorize studies on fair MedIA into fairness evaluation and unfairness mitigation. Detailed methods employed in these studies are presented too. Our survey concludes with a discussion of existing challenges and opportunities in establishing a fair MedIA and healthcare system. By offering this comprehensive review, we aim to foster a shared understanding of fairness among AI researchers and clinicians, enhance the development of unfairness mitigation methods, and contribute to the creation of an equitable MedIA society.
