Cultural Learning-Based Culture Adaptation of Language Models
Chen Cecilia Liu, Anna Korhonen, Iryna Gurevych
TL;DR
This work introduces CLCA, a culture-learning-based framework that aligns LLMs with diverse cultural values by simulating culture-adapted social interactions. It combines imitative (dialogue) and instructed (intent) learning to form a multi-task training objective and evaluates alignment using World Values Survey data across five cultures. Results show that CLCA improves culture-level alignment and individual-level accuracy across multiple model families, with social interaction data and intent understanding playing complementary roles. Multilingual transfer indicates the approach generalizes beyond English, though teacher-model quality influences data generation. The study highlights the potential of cultural learning to create more globally inclusive NLP systems while acknowledging biases in synthetic data and the need for real-world human data for further validation.
Abstract
Adapting large language models (LLMs) to diverse cultural values is a challenging task, as existing LLMs often reflect the values of specific groups by default, and potentially causing harm to others. In this paper, we present CLCA, a novel framework for enhancing LLM alignment with cultural values based on cultural learning. The framework leverages simulated social interactions to generate conversations in which LLMs engage in role-playing within culturally adapted social scenarios, capturing implicit cultural norms for model fine-tuning. CLCA improves cultural value alignment across various model architectures measured using World Value Survey data, demonstrating the effectiveness of our proposed approach. Our results provide early evidence that understanding intent and social interactions can enhance cultural value adaptation in LLMs, highlighting the promise of training approaches based on cultural learning.
