Challenging Negative Gender Stereotypes: A Study on the Effectiveness of Automated Counter-Stereotypes
Isar Nejadgholi, Kathleen C. Fraser, Anna Kerkhof, Svetlana Kiritchenko
TL;DR
Gender stereotypes persist in online discourse and can constrain opportunities and perceptions. The authors generate counter-stereotypes automatically using eleven strategies via ChatGPT and evaluate them with a 75-participant annotation study, yielding 185 usable counter-statements across ten strategies. The study identifies counter-facts and broadening universals as the most robust approaches, while humour and several other strategies tend to be ineffective or offensive, with variation more tied to stereotype target than rater gender. This work provides a publicly available dataset of ratings to guide future design of AI-assisted counter-speech, highlighting the need for audience-aware deployment, ethical considerations, and ongoing evaluation in online interventions.
Abstract
Gender stereotypes are pervasive beliefs about individuals based on their gender that play a significant role in shaping societal attitudes, behaviours, and even opportunities. Recognizing the negative implications of gender stereotypes, particularly in online communications, this study investigates eleven strategies to automatically counter-act and challenge these views. We present AI-generated gender-based counter-stereotypes to (self-identified) male and female study participants and ask them to assess their offensiveness, plausibility, and potential effectiveness. The strategies of counter-facts and broadening universals (i.e., stating that anyone can have a trait regardless of group membership) emerged as the most robust approaches, while humour, perspective-taking, counter-examples, and empathy for the speaker were perceived as less effective. Also, the differences in ratings were more pronounced for stereotypes about the different targets than between the genders of the raters. Alarmingly, many AI-generated counter-stereotypes were perceived as offensive and/or implausible. Our analysis and the collected dataset offer foundational insight into counter-stereotype generation, guiding future efforts to develop strategies that effectively challenge gender stereotypes in online interactions.
