Towards Fair Medical AI: Adversarial Debiasing of 3D CT Foundation Embeddings

Guangyao Zheng; Michael A. Jacobs; Vladimir Braverman; Vishwa S. Parekh

Towards Fair Medical AI: Adversarial Debiasing of 3D CT Foundation Embeddings

Guangyao Zheng, Michael A. Jacobs, Vladimir Braverman, Vishwa S. Parekh

TL;DR

The paper tackles demographic leakage in self-supervised 3D CT foundation embeddings (dimension $d=1408$) by proposing a VAE-based adversarial debiasing framework that maps embeddings to a demographic-free latent space of dimension $k=500$, preserving downstream lung cancer risk prediction. It introduces an encoder–decoder VAE with an adversarial head for age and sex, trained to minimize reconstruction and KL loss while thwarting demographic inference. Empirical results show the demographic signals become largely unrecoverable (e.g., sex predictor accuracy and AUC drop substantially) with negligible loss to 1-year and 2-year cancer prediction performance, and with improved fairness via lower EOD. The approach also demonstrates robustness to data-poisoning attacks targeting demographic groups, underscoring its potential for fairer and more secure deployment of 3D CT foundation models in clinical decision-making.$

Abstract

Self-supervised learning has revolutionized medical imaging by enabling efficient and generalizable feature extraction from large-scale unlabeled datasets. Recently, self-supervised foundation models have been extended to three-dimensional (3D) computed tomography (CT) data, generating compact, information-rich embeddings with 1408 features that achieve state-of-the-art performance on downstream tasks such as intracranial hemorrhage detection and lung cancer risk forecasting. However, these embeddings have been shown to encode demographic information, such as age, sex, and race, which poses a significant risk to the fairness of clinical applications. In this work, we propose a Variation Autoencoder (VAE) based adversarial debiasing framework to transform these embeddings into a new latent space where demographic information is no longer encoded, while maintaining the performance of critical downstream tasks. We validated our approach on the NLST lung cancer screening dataset, demonstrating that the debiased embeddings effectively eliminate multiple encoded demographic information and improve fairness without compromising predictive accuracy for lung cancer risk at 1-year and 2-year intervals. Additionally, our approach ensures the embeddings are robust against adversarial bias attacks. These results highlight the potential of adversarial debiasing techniques to ensure fairness and equity in clinical applications of self-supervised 3D CT embeddings, paving the way for their broader adoption in unbiased medical decision-making.

Towards Fair Medical AI: Adversarial Debiasing of 3D CT Foundation Embeddings

TL;DR

Abstract

Towards Fair Medical AI: Adversarial Debiasing of 3D CT Foundation Embeddings

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)