Mapping the political landscape from data traces: multidimensional opinions of users, politicians and media outlets on X
Antoine Vendeville, Jimena Royo-Letelier, Duncan Cassells, Jean-Philippe Cointet, Maxime Crépel, Tim Faverjon, Théophile Lenoir, Béatrice Mazoyer, Benjamin Ooghe-Tabanou, Armin Pournaki, Hiroki Yamashita, Pedro Ramaciotti
TL;DR
This work creates the first public, multidimensional, continuous opinion dataset for almost $N=978{,}933$ X users, 883 French MPs, and hundreds of media domains, calibrated across 16 CHES dimensions. It derives latent positions from the MP–follower follow network via Correspondence Analysis and maps them onto CHES axes using an affine transformation with Ridge regularization, yielding interpretable ideological and issue coordinates $\phi \in \mathbb{R}^P$ and $P$ up to 883. Validation combines bios-based LLM and human annotations to demonstrate monotone concentration and clear separation along CHES dimensions, and cross-wave consistency between CHES 2019 and 2023. The dataset enables nuanced analyses of polarization, media influence, and political space dynamics beyond traditional Left-Right, supporting cross-country and cross-platform extensions while acknowledging platform-specific biases and sampling limitations. Overall, the resource provides a robust, scalable framework for studying multidimensional online political ecosystems with practical implications for understanding online debates and media effects.
Abstract
Studying political activity on social media often requires defining and measuring political stances of users or content. Relevant examples include the study of opinion polarization, or the study of political diversity in online content diets. While many research designs rely on operationalizations best suited for the US setting, few allow addressing more general political systems, in which users and media outlets might exhibit stances on multiple ideology and issue dimensions, going beyond traditional Liberal-Conservative or Left-Right scales. To advance the study of more general online ecosystems, we present a dataset pertaining to a population of X/Twitter users, parliamentarians, and media outlets embedded in a political space spanned by dimensions measuring attitudes towards immigration, the EU, liberal values, elites and institutions, nationalism and the environment, in addition to left-right and liberal-conservative scales. We include indicators of individual activity and popularity: mean number of posts per day, number of followers, and number of followees. We provide several benchmarks validating the positions of these entities and discuss several applications for this dataset.
