Fairness-Aware Multi-Group Target Detection in Online Discussion
Soumyajit Gupta, Maria De-Arteaga, Matthew Lease
TL;DR
This work addresses target-group detection in online discourse as a multi-label task where posts can target multiple demographic groups, and fairness across groups is crucial. It introduces Accuracy Parity (AP) as the fairness objective and proposes GAP and its multi-target extension GAP_{multi} to directly optimize balanced group performance while preserving overall accuracy. Theoretical results show EO and AP cannot be satisfied simultaneously under unequal base rates, motivating AP as a more appropriate fairness criterion for this setting. Empirically, GAP_{multi} yields improved fairness (smaller pairwise BA disparities and higher macro BA) with competitive utility across two large datasets (MHS and HateXplain), and the approach scales with parallel GPU computation. The work provides a practical, reproducible framework for fair target-group detection in toxic content analysis and related moderation tasks, with ethical considerations and deployment guidance.
Abstract
Target-group detection is the task of detecting which group(s) a piece of content is ``directed at or about''. Applications include targeted marketing, content recommendation, and group-specific content assessment. Key challenges include: 1) that a single post may target multiple groups; and 2) ensuring consistent detection accuracy across groups for fairness. In this work, we investigate fairness implications of target-group detection in the context of toxicity detection, where the perceived harm of a social media post often depends on which group(s) it targets. Because toxicity is highly contextual, language that appears benign in general can be harmful when targeting specific demographic groups. We show our {\em fairness-aware multi-group target detection} approach both reduces bias across groups and shows strong predictive performance, surpassing existing fairness-aware baselines. To enable reproducibility and spur future work, we share our code online.
