Post-hoc Orthogonalization for Mitigation of Protected Feature Bias in CXR Embeddings
Tobias Weber, Michael Ingrisch, Bernd Bischl, David Rügamer
TL;DR
The paper investigates bias in chest X-ray embeddings by protected features (age, sex, race) and introduces a post-hoc orthogonalization method to remove their influence. By projecting embeddings onto the space orthogonal to protected attributes, the authors produce $ ilde{E}$ that no longer encodes information about $X$, and they demonstrate this on MIMIC and CheXpert using three pre-trained embeddings. Results show that protected features significantly affect pathology predictions without orthogonalization, but after orthogonalization the protected-information predictability drops to near random and subgroup disparities diminish, with minimal to moderate impact on downstream pathology performance. This approach is model-agnostic and post-hoc, offering a practical fairness tool for medical imaging embeddings, while highlighting limitations such as linearity assumptions and residual disparities that motivate future work with nonlinear extensions and semi-structured models.
Abstract
Purpose: To analyze and remove protected feature effects in chest radiograph embeddings of deep learning models. Methods: An orthogonalization is utilized to remove the influence of protected features (e.g., age, sex, race) in CXR embeddings, ensuring feature-independent results. To validate the efficacy of the approach, we retrospectively study the MIMIC and CheXpert datasets using three pre-trained models, namely a supervised contrastive, a self-supervised contrastive, and a baseline classifier model. Our statistical analysis involves comparing the original versus the orthogonalized embeddings by estimating protected feature influences and evaluating the ability to predict race, age, or sex using the two types of embeddings. Results: Our experiments reveal a significant influence of protected features on predictions of pathologies. Applying orthogonalization removes these feature effects. Apart from removing any influence on pathology classification, while maintaining competitive predictive performance, orthogonalized embeddings further make it infeasible to directly predict protected attributes and mitigate subgroup disparities. Conclusion: The presented work demonstrates the successful application and evaluation of the orthogonalization technique in the domain of chest X-ray image classification.
