Generative AI Voting: Fair Collective Choice is Resilient to LLM Biases and Inconsistencies
Srijoni Majumdar, Edith Elkind, Evangelos Pournaras
TL;DR
This work investigates how Generative AI voting with large language models can scale deliberative participation while exposing biases and inconsistencies in AI representations of voters. Using a factorial design across real-world datasets and more than 50,000 AI voting personas in 306 elections, the authors quantify inconsistencies such as under-representation, AI approximation errors, and AI intransitivity across ballot formats. They demonstrate that fair collective choice methods like equal shares and Phragmén's rule yield greater resilience to AI biases, and that AI representations of abstaining voters can meaningfully recover lost consistency under these fair rules. The study also links abstainer traits to specific cognitive biases and uses explainable AI tools to map how these traits drive AI decisions, offering guidance for safeguarding democracy as AI-assisted voting scales up in participatory budgeting and elections.
Abstract
Scaling up deliberative and voting participation is a longstanding endeavor -- a cornerstone for direct democracy and legitimate collective choice. Recent breakthroughs in generative artificial intelligence (AI) and large language models (LLMs) unravel new capabilities for AI personal assistants to overcome cognitive bandwidth limitations of humans, providing decision support or even direct representation of human voters at large scale. However, the quality of this representation and what underlying biases manifest when delegating collective decision-making to LLMs is an alarming and timely challenge to tackle. By rigorously emulating with high realism more than >50K LLM voting personas in 306 real-world voting elections, we disentangle the nature of different biases in LLMS (GPT 3, GPT 3.5, and Llama2). Complex preferential ballot formats exhibit significant inconsistencies compared to simpler majoritarian elections that show higher consistency. Strikingly though, by demonstrating for the first time in real-world a proportional representation of voters in direct democracy, we are also able to show that fair ballot aggregation methods, such as equal shares, prove to be a win-win: fairer voting outcomes for humans with fairer AI representation, especially for voters who are likely to abstain. This novel underlying relationship proves paramount for democratic resilience in progressives scenarios with low voters turnout and voter fatigue supported by AI representatives: abstained voters are mitigated by recovering highly representative voting outcomes that are fairer. These interdisciplinary insights provide remarkable foundations for science, policymakers, and citizens to develop safeguards and resilience for AI risks in democratic innovations.
