Mitigating Gender Bias in Depression Detection via Counterfactual Inference
Mingxuan Hu, Hongbo Ma, Xinlan Wu, Ziqi Liu, Jiaqi Liu, Yangbin Chen
TL;DR
The paper tackles gender bias in audio-based depression detection caused by imbalanced data. It introduces a counterfactual debiasing framework grounded in causal inference to isolate and remove the direct effect of gender on predictions, leveraging two neural components for gender and acoustic fusion and using counterfactual inference at test time. Empirical results on DAIC-WOZ with two acoustic backbones show reduced bias (lower EA and DI) and improved overall performance, including substantial gains for male depression detection. This approach offers a principled, interpretable method to disentangle gender from true pathological vocal cues, with implications for fairer automated mental health assessment. The authors also outline future multimodal extensions to further mitigate bias across data modalities.
Abstract
Audio-based depression detection models have demonstrated promising performance but often suffer from gender bias due to imbalanced training data. Epidemiological statistics show a higher prevalence of depression in females, leading models to learn spurious correlations between gender and depression. Consequently, models tend to over-diagnose female patients while underperforming on male patients, raising significant fairness concerns. To address this, we propose a novel Counterfactual Debiasing Framework grounded in causal inference. We construct a causal graph to model the decision-making process and identify gender bias as the direct causal effect of gender on the prediction. During inference, we employ counterfactual inference to estimate and subtract this direct effect, ensuring the model relies primarily on authentic acoustic pathological features. Extensive experiments on the DAIC-WOZ dataset using two advanced acoustic backbones demonstrate that our framework not only significantly reduces gender bias but also improves overall detection performance compared to existing debiasing strategies.
