Building Understandable Messaging for Policy and Evidence Review (BUMPER) with AI
Katherine A. Rosenfeld, Maike Sonnewald, Sonia J. Jindal, Kevin A. McCarthy, Joshua L. Proctor
TL;DR
The paper addresses the challenge of translating diverse scientific evidence into policy while mitigating the trust and accountability issues that accompany large language models, by introducing the Building Understandable Messaging for Policy and Evidence Review (BUMPER) framework. It formalizes a scientists-driven knowledge base $K$, actions $A=(a_0,...,a_J)$, guidelines $G$, and topics $T$ to drive a six-step evidence-synthesis loop, and introduces a novel compliance score $S$ derived from token probabilities $P_T(G)$ to separate in-scope conformity from factual correctness, keeping scientists in the loop. The contributions include formalizing ownership, establishing a transparent, scope-limited evaluation workflow, and prototyping code with rugby and measles health-policy case studies to demonstrate practical translation. The framework aims to scale to independent data sources without fine-tuning, improving accessibility and confidence in evidence-informed policy while highlighting the need for validation and human collaboration in high-stakes settings.
Abstract
We introduce a framework for the use of large language models (LLMs) in Building Understandable Messaging for Policy and Evidence Review (BUMPER). LLMs are proving capable of providing interfaces for understanding and synthesizing large databases of diverse media. This presents an exciting opportunity to supercharge the translation of scientific evidence into policy and action, thereby improving livelihoods around the world. However, these models also pose challenges related to access, trust-worthiness, and accountability. The BUMPER framework is built atop a scientific knowledge base (e.g., documentation, code, survey data) by the same scientists (e.g., individual contributor, lab, consortium). We focus on a solution that builds trustworthiness through transparency, scope-limiting, explicit-checks, and uncertainty measures. LLMs are rapidly being adopted and consequences are poorly understood. The framework addresses open questions regarding the reliability of LLMs and their use in high-stakes applications. We provide a worked example in health policy for a model designed to inform measles control programs. We argue that this framework can facilitate accessibility of and confidence in scientific evidence for policymakers, drive a focus on policy-relevance and translatability for researchers, and ultimately increase and accelerate the impact of scientific knowledge used for policy decisions.
