Do Voters Get the Information They Want? Understanding Authentic Voter FAQs in the US and How to Improve for Informed Electoral Participation
Vipula Rawte, Deja N Scott, Gaurav Kumar, Aishneet Juneja, Bharat Sowrya Yaddanapalli, Biplav Srivastava
TL;DR
The paper addresses the lack of a national, comprehensive voter FAQ resource by constructing the first 50-state Voter FAQ dataset and introducing FAQ Information Quality Scores (FIQS) to quantify readability, summarization, sentiment, and topic coverage. It applies standardized NLP techniques (readability metrics, extractive/abstractive summarization, LDA topic modeling, and VADER sentiment) to analyze FAQs from SECs and LWV, revealing state-level leaders and laggards and proposing practical guidelines for improving the information ecosystem. The authors also explore state specificity of questions, promptability of LLMs (via fine-tuned Llama-3.1-8B), and provide a structured blueprint for how states can enhance accessibility, accuracy, and neutrality of election information. The work offers a foundation for developing decision-support tools and encourages broader data sourcing to strengthen informed electoral participation in the United States.
Abstract
Accurate information is crucial for democracy as it empowers voters to make informed decisions about their representatives and keeping them accountable. In the US, state election commissions (SECs), often required by law, are the primary providers of Frequently Asked Questions (FAQs) to voters, and secondary sources like non-profits such as League of Women Voters (LWV) try to complement their information shortfall. However, surprisingly, to the best of our knowledge, there is neither a single source with comprehensive FAQs nor a study analyzing the data at national level to identify current practices and ways to improve the status quo. This paper addresses it by providing the {\bf first dataset on Voter FAQs covering all the US states}. Second, we introduce metrics for FAQ information quality (FIQ) with respect to questions, answers, and answers to corresponding questions. Third, we use FIQs to analyze US FAQs to identify leading, mainstream and lagging content practices and corresponding states. Finally, we identify what states across the spectrum can do to improve FAQ quality and thus, the overall information ecosystem. Across all 50 U.S. states, 12% were identified as leaders and 8% as laggards for FIQS\textsubscript{voter}, while 14% were leaders and 12% laggards for FIQS\textsubscript{developer}.
