Small but Fair! Fairness for Multimodal Human-Human and Robot-Human Mental Wellbeing Coaching
Jiaee Cheong, Micol Spitale, Hatice Gunes
TL;DR
The paper tackles fairness in ML for small multimodal datasets in wellbeing coaching, extending fairness analysis to human-human and robot-human interactions. It introduces MixFeat, a Mixup-based pre-processing data augmentation method, to counteract biases across gender and race while leveraging high-level, multimodal features. Through comprehensive experiments on AFAR-BSFT, AFAR-RC22, and AFAR-RC23, the authors show that high-level features and multimodal fusion generally improve both predictive accuracy and fairness, and that MixFeat can outperform standard oversampling baselines. The work provides practical recommendations for researchers in affective computing and HRI to adopt fairness-aware ML practices, especially when data are scarce, and emphasizes ethical considerations for deploying robot-assisted wellbeing coaching.
Abstract
In recent years, the affective computing (AC) and human-robot interaction (HRI) research communities have put fairness at the centre of their research agenda. However, none of the existing work has addressed the problem of machine learning (ML) bias in HRI settings. In addition, many of the current datasets for AC and HRI are "small", making ML bias and debias analysis challenging. This paper presents the first work to explore ML bias analysis and mitigation of three small multimodal datasets collected within both a human-human and robot-human wellbeing coaching settings. The contributions of this work includes: i) being the first to explore the problem of ML bias and fairness within HRI settings; and ii) providing a multimodal analysis evaluated via modelling performance and fairness metrics across both high and low-level features and proposing a simple and effective data augmentation strategy (MixFeat) to debias the small datasets presented within this paper; and iii) conducting extensive experimentation and analyses to reveal ML fairness insights unique to AC and HRI research in order to distill a set of recommendations to aid AC and HRI researchers to be more engaged with fairness-aware ML-based research.
