Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures
Akhila Yerukola, Saadia Gabriel, Nanyun Peng, Maarten Sap
TL;DR
This work introduces MC-SIGNS, a cross-cultural gesture dataset with 288 gesture-country pairs across 25 gestures and 85 countries, annotated for offensiveness, cultural significance, and context. It systematically evaluates AI systems across text-to-image, language, and vision-language modalities, revealing pervasive US-centric biases, over-flagging by LLMs, and context-sensitive safety gaps that worsen with scene descriptions. Implicit-meaning analyses show models default to US interpretations even for universal concepts, underscoring the need for culturally aware safety frameworks and regionally informed data. The authors release MC-SIGNS and accompanying code to propel research on inclusive, culturally safe AI deployment in global applications.
Abstract
Gestures are an integral part of non-verbal communication, with meanings that vary across cultures, and misinterpretations that can have serious social and diplomatic consequences. As AI systems become more integrated into global applications, ensuring they do not inadvertently perpetuate cultural offenses is critical. To this end, we introduce Multi-Cultural Set of Inappropriate Gestures and Nonverbal Signs (MC-SIGNS), a dataset of 288 gesture-country pairs annotated for offensiveness, cultural significance, and contextual factors across 25 gestures and 85 countries. Through systematic evaluation using MC-SIGNS, we uncover critical limitations: text-to-image (T2I) systems exhibit strong US-centric biases, performing better at detecting offensive gestures in US contexts than in non-US ones; large language models (LLMs) tend to over-flag gestures as offensive; and vision-language models (VLMs) default to US-based interpretations when responding to universal concepts like wishing someone luck, frequently suggesting culturally inappropriate gestures. These findings highlight the urgent need for culturally-aware AI safety mechanisms to ensure equitable global deployment of AI technologies.
