Which Identities Are Mobilized: Towards an automated detection of social group appeals in political texts
Felicia Riethmüller, Julian Dehne, Denise Al-Gaddooa
TL;DR
Riethmüller et al. tackle how political actors mobilize social identities and how to detect these cues at scale in European manifestos. They introduce a hybrid detection pipeline—dictionary seeds, BERT-based expansion, and a Mistral-7B with Embedding Space-based Filtering—to identify social group mentions across the Manifesto Corpus in 15 countries from 1980 to 2021. The LLM-ESF approach provides the strongest coverage and a scalable workflow, enabling cross-national mapping of party group images and comparisons between Radical Right and mainstream parties. Their empirical tests find no robust contagion: Radical Right strength or mainstream vote losses do not consistently drive convergence in group appeals, although patterns vary by country; the method offers a transferable tool for studying group-based rhetoric in political texts and related CSS contexts.
Abstract
This paper proposes a computational text classification strategy to identify references to social groups in European party manifestos and beyond. Our methodology uses machine learning techniques, including BERT and large language models, to capture group-based appeals in texts. We propose to combine automated identification of social groups using the Mistral-7B-v0.1 Large Language Model with Embedding Space-based filtering to extend a sample of core social groups to all social groups mentioned in party manifestos. By applying this approach to RRP's and mainstream parties' group images in manifestos, we explore whether electoral dynamics explain similarities in group appeals and potential convergence or divergence in party strategies. Contrary to expectations, increasing RRP support or mainstream parties' vote loss does not necessarily lead to convergence in group appeals. Nonetheless, our methodology enables mapping similarities in group appeals across time and space in 15 European countries from 1980 to 2021 and can be transferred to other use cases as well.
