The Human Flourishing Geographic Index: A County-Level Dataset for the United States, 2013--2023
Stefano M. Iacus, Devika Jain, Andrea Nasuto, Giuseppe Porro, Marcello Carammia, Andrea Vezzulli
TL;DR
The paper introduces the Human Flourishing Geographic Index (HFGI), a high-resolution, county- and state-level dataset of flourishing-related expressions derived from 2.6 billion geolocated U.S. tweets from 2013–2023. It uses fine-tuned LLMs (Llama 3.2 3B) to map tweets to 46–48 flourishing indicators aligned with the Global Flourishing Study, producing monthly and yearly indicators with salience measures. The authors validate HFGI against external data (TSGI, CDC mental health metrics, CPI) and explore relationships with climate risk, revealing meaningful spatial patterns, rural–urban differences, and an interpretation framework distinguishing expression propensity from prevalence. They provide comprehensive usage notes, data availability, and a codebook, enabling cross-disciplinary analyses of well-being, inequality, and social dynamics at an unprecedented scale and resolution.
Abstract
Quantifying human flourishing, a multidimensional construct including happiness, health, purpose, virtue, relationships, and financial stability, is critical for understanding societal well-being beyond economic indicators. Existing measures often lack fine spatial and temporal resolution. Here we introduce the Human Flourishing Geographic Index (HFGI), derived from analyzing approximately 2.6 billion geolocated U.S. tweets (2013-2023) using fine-tuned large language models to classify expressions across 48 indicators aligned with Harvard's Global Flourishing Study framework plus attitudes towards migration and perception of corruption. The dataset offers monthly and yearly county- and state-level indicators of flourishing-related discourse, validated to confirm that the measures accurately represent the underlying constructs and show expected correlations with established indicators. This resource enables multidisciplinary analyses of well-being, inequality, and social change at unprecedented resolution, offering insights into the dynamics of human flourishing as reflected in social media discourse across the United States over the past decade.
