CulFiT: A Fine-grained Cultural-aware LLM Training Paradigm via Multilingual Critique Data Synthesis
Ruixiang Feng, Shen Gao, Xiuying Chen, Lisi Chen, Shuo Shang
TL;DR
CulFiT addresses cultural bias in LLMs by introducing a target-aware, multilingual training paradigm that leverages fine-grained critique data synthesis and a robust reward system. The approach decomposes cultural knowledge into verifiable units, generates multilingual critiques, and uses a two-stage SFT+DPO pipeline to align model outputs with diverse cultural norms. A new multilingual benchmark, GlobalCultureQA, alongside existing cultural benchmarks, demonstrates that CulFiT achieves state-of-the-art open-source performance in cultural alignment and general reasoning, while Hofstede-based analyses show improved cross-cultural value alignment. The work highlights the importance of multilingual data, interpretable evaluation metrics, and targeted feedback in building more inclusive and reliable AI systems, while noting computational costs and data coverage as ongoing challenges.
Abstract
Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks, yet they often exhibit a specific cultural biases, neglecting the values and linguistic diversity of low-resource regions. This cultural bias not only undermines universal equality, but also risks reinforcing stereotypes and perpetuating discrimination. To address this, we propose CulFiT, a novel culturally-aware training paradigm that leverages multilingual data and fine-grained reward modeling to enhance cultural sensitivity and inclusivity. Our approach synthesizes diverse cultural-related questions, constructs critique data in culturally relevant languages, and employs fine-grained rewards to decompose cultural texts into verifiable knowledge units for interpretable evaluation. We also introduce GlobalCultureQA, a multilingual open-ended question-answering dataset designed to evaluate culturally-aware responses in a global context. Extensive experiments on three existing benchmarks and our GlobalCultureQA demonstrate that CulFiT achieves state-of-the-art open-source model performance in cultural alignment and general reasoning.
