Table of Contents
Fetching ...

Open Challenges on Fairness of Artificial Intelligence in Medical Imaging Applications

Enzo Ferrante, Rodrigo Echeveste

TL;DR

This chapter surveys open challenges in ensuring fairness for AI in medical imaging, highlighting bias sources across data, models, and people. It discusses how biased metrics, leveling-down risks, varying task difficulty, unseen populations, and non-demographic confounders complicate fairness audits, emphasizing that there are no definitive solutions yet. The authors review potential directions such as adversarially reweighted learning, epistemic-uncertainty weighting, counterfactual explanations, and unsupervised audits, while cautioning about their limitations. The work underscores the need for robust, intersectional, and globally informed fairness practices to support reliable, ethically sound deployment of medical imaging AI tools.

Abstract

Recently, the research community of computerized medical imaging has started to discuss and address potential fairness issues that may emerge when developing and deploying AI systems for medical image analysis. This chapter covers some of the pressing challenges encountered when doing research in this area, and it is intended to raise questions and provide food for thought for those aiming to enter this research field. The chapter first discusses various sources of bias, including data collection, model training, and clinical deployment, and their impact on the fairness of machine learning algorithms in medical image computing. We then turn to discussing open challenges that we believe require attention from researchers and practitioners, as well as potential pitfalls of naive application of common methods in the field. We cover a variety of topics including the impact of biased metrics when auditing for fairness, the leveling down effect, task difficulty variations among subgroups, discovering biases in unseen populations, and explaining biases beyond standard demographic attributes.

Open Challenges on Fairness of Artificial Intelligence in Medical Imaging Applications

TL;DR

This chapter surveys open challenges in ensuring fairness for AI in medical imaging, highlighting bias sources across data, models, and people. It discusses how biased metrics, leveling-down risks, varying task difficulty, unseen populations, and non-demographic confounders complicate fairness audits, emphasizing that there are no definitive solutions yet. The authors review potential directions such as adversarially reweighted learning, epistemic-uncertainty weighting, counterfactual explanations, and unsupervised audits, while cautioning about their limitations. The work underscores the need for robust, intersectional, and globally informed fairness practices to support reliable, ethically sound deployment of medical imaging AI tools.

Abstract

Recently, the research community of computerized medical imaging has started to discuss and address potential fairness issues that may emerge when developing and deploying AI systems for medical image analysis. This chapter covers some of the pressing challenges encountered when doing research in this area, and it is intended to raise questions and provide food for thought for those aiming to enter this research field. The chapter first discusses various sources of bias, including data collection, model training, and clinical deployment, and their impact on the fairness of machine learning algorithms in medical image computing. We then turn to discussing open challenges that we believe require attention from researchers and practitioners, as well as potential pitfalls of naive application of common methods in the field. We cover a variety of topics including the impact of biased metrics when auditing for fairness, the leveling down effect, task difficulty variations among subgroups, discovering biases in unseen populations, and explaining biases beyond standard demographic attributes.
Paper Structure (12 sections, 1 figure)

This paper contains 12 sections, 1 figure.

Figures (1)

  • Figure 1: Main potential sources of bias in AI systems for MIC. The data being fed to the system during training (1), design choices for the model (2), and the people who develop those systems (3), may all contribute to biases in AI systems for MIC. From Ref. ricci2022addressing with permission