Understanding-informed Bias Mitigation for Fair CMR Segmentation
Tiarna Lee, Esther Puyol-Antón, Bram Ruijsink, Pier-Giorgio Masci, Louise Keehn, Phil Chowienczyk, Emily Haseler, Miaojing Shi, Andrew P. King
TL;DR
This study addresses ethnicity bias in AI-based cine CMR segmentation by evaluating multiple bias-mitigation strategies (oversampling, reweighing, Group DRO) and introducing root-cause–informed cropping to remove non-heart features driving bias. It demonstrates that oversampling effectively reduces group disparities and that cropping, especially a cascaded approach, boosts overall accuracy while diminishing bias; combining cropping with oversampling yields further gains. The work includes extensive internal and external validation, showing strong segmentation performance and minimal bias on an external clinical dataset, and argues that fairness-accuracy trade-offs can be avoided with a bias-understanding approach. The findings have practical implications for translating bias-mitigation strategies to clinical CMR analysis, enabling more equitable biomarker assessment and treatment planning.
Abstract
Artificial intelligence (AI) is increasingly being used for medical imaging tasks. However, there can be biases in AI models, particularly when they are trained using imbalanced training datasets. One such example has been the strong ethnicity bias effect in cardiac magnetic resonance (CMR) image segmentation models. Although this phenomenon has been reported in a number of publications, little is known about the effectiveness of bias mitigation algorithms in this domain. We aim to investigate the impact of common bias mitigation methods to address bias between Black and White subjects in AI-based CMR segmentation models. Specifically, we use oversampling, importance reweighing and Group DRO as well as combinations of these techniques to mitigate the ethnicity bias. Second, motivated by recent findings on the root causes of AI-based CMR segmentation bias, we evaluate the same methods using models trained and evaluated on cropped CMR images. We find that bias can be mitigated using oversampling, significantly improving performance for the underrepresented Black subjects whilst not significantly reducing the majority White subjects' performance. Using cropped images increases performance for both ethnicities and reduces the bias, whilst adding oversampling as a bias mitigation technique with cropped images reduces the bias further. When testing the models on an external clinical validation set, we find high segmentation performance and no statistically significant bias.
