Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness
Maximilian Spliethöver, Sai Nikhil Menon, Henning Wachsmuth
TL;DR
This work tackles dialect-induced unfairness in social bias detection by introducing a dialect-aware multitask framework. A shared encoder jointly learns five bias aspects and an auxiliary dialect task, trained with a round-robin cross-entropy objective over $k+1$ tasks and augmented by dialect labels attached to SBIC via a dialect classifier. Empirical results show state-of-the-art bias-detection performance and reduced dialect-based disparities, particularly for African-American English, with notable gains in equalized odds and predictive parity. The combination of multitask learning and dialect modeling improves both accuracy and fairness, and the approach can be extended to additional dialects and data-augmentation strategies to broaden applicability. Overall, the study provides evidence that encoding dialect patterns into models enhances fairness and robustness in bias detection across dialectal text.
Abstract
Dialects introduce syntactic and lexical variations in language that occur in regional or social groups. Most NLP methods are not sensitive to such variations. This may lead to unfair behavior of the methods, conveying negative bias towards dialect speakers. While previous work has studied dialect-related fairness for aspects like hate speech, other aspects of biased language, such as lewdness, remain fully unexplored. To fill this gap, we investigate performance disparities between dialects in the detection of five aspects of biased language and how to mitigate them. To alleviate bias, we present a multitask learning approach that models dialect language as an auxiliary task to incorporate syntactic and lexical variations. In our experiments with African-American English dialect, we provide empirical evidence that complementing common learning approaches with dialect modeling improves their fairness. Furthermore, the results suggest that multitask learning achieves state-of-the-art performance and helps to detect properties of biased language more reliably.
