Table of Contents
Fetching ...

Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble

Bingchen Wang, Zi-Yu Khoo, Jingtan Wang

TL;DR

Prompts to Proxies introduces preference reconstruction theory to align LLM proxies with target populations by constructing a functional basis of proxy agents and weighting them to reproduce observed survey responses. The two-stage P2P pipeline builds a diverse agent pool via entropy-guided prompts and selects a compact ensemble via L1-regularized regression to match observed distributions without demographic data or fine-tuning. Empirical results across 14 ATP waves and the World Values Survey show improved distributional fidelity and favorable cost relative to prompting baselines, with robust cross-locale generalization and a stress test against SFT baselines under topic shift. The work advances pluralistic alignment for social science simulations and points to future extensions to freeform outputs, non-stationary preferences, and model steerability benchmarks.

Abstract

Large language models are increasingly used as proxies for human subjects in social science research, yet external validity requires that synthetic agents faithfully reflect the preferences of target human populations. We introduce *preference reconstruction theory*, a framework that formalizes preference alignment as a representation learning problem: constructing a functional basis of proxy agents and recovering population preferences through weighted aggregation. We implement this via *Prompts to Proxies* ($\texttt{P2P}$), a modular two-stage system. Stage 1 uses structured prompting with entropy-based adaptive sampling to construct a diverse agent pool spanning the latent preference space. Stage 2 employs L1-regularized regression to select a compact ensemble whose aggregate response distributions align with observed data from the target population. $\texttt{P2P}$ requires no finetuning and no access to sensitive demographic data, incurring only API inference costs. We validate the approach on 14 waves of the American Trends Panel, achieving an average test MSE of 0.014 across diverse topics at approximately 0.8 USD per survey. We additionally test it on the World Values Survey, demonstrating its potential to generalize across locales. When stress-tested against an SFT-aligned baseline, $\texttt{P2P}$ achieves competitive performance using less than 3% of the training data.

Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble

TL;DR

Prompts to Proxies introduces preference reconstruction theory to align LLM proxies with target populations by constructing a functional basis of proxy agents and weighting them to reproduce observed survey responses. The two-stage P2P pipeline builds a diverse agent pool via entropy-guided prompts and selects a compact ensemble via L1-regularized regression to match observed distributions without demographic data or fine-tuning. Empirical results across 14 ATP waves and the World Values Survey show improved distributional fidelity and favorable cost relative to prompting baselines, with robust cross-locale generalization and a stress test against SFT baselines under topic shift. The work advances pluralistic alignment for social science simulations and points to future extensions to freeform outputs, non-stationary preferences, and model steerability benchmarks.

Abstract

Large language models are increasingly used as proxies for human subjects in social science research, yet external validity requires that synthetic agents faithfully reflect the preferences of target human populations. We introduce *preference reconstruction theory*, a framework that formalizes preference alignment as a representation learning problem: constructing a functional basis of proxy agents and recovering population preferences through weighted aggregation. We implement this via *Prompts to Proxies* (), a modular two-stage system. Stage 1 uses structured prompting with entropy-based adaptive sampling to construct a diverse agent pool spanning the latent preference space. Stage 2 employs L1-regularized regression to select a compact ensemble whose aggregate response distributions align with observed data from the target population. requires no finetuning and no access to sensitive demographic data, incurring only API inference costs. We validate the approach on 14 waves of the American Trends Panel, achieving an average test MSE of 0.014 across diverse topics at approximately 0.8 USD per survey. We additionally test it on the World Values Survey, demonstrating its potential to generalize across locales. When stress-tested against an SFT-aligned baseline, achieves competitive performance using less than 3% of the training data.

Paper Structure

This paper contains 70 sections, 2 theorems, 24 equations, 19 figures, 15 tables, 2 algorithms.

Key Result

Proposition 1.0

Let $\widetilde{\mathcal{P}}$ be a latent preference space (regardless of human or model), with an individual preference denoted by $\tilde{p} \in \widetilde{\mathcal{P}}$. Let $g$ be the map from $\widetilde{\mathcal{P}}$ to the observed response space $\widetilde{\mathcal{R}} \subset \mathbb{R}^n$

Figures (19)

  • Figure 1: Top: Finding a functional basis to recover latent ground-truth preference. Bottom: Identification of latent preference through multiple observations.
  • Figure 2: System Overview of the Prompts to Proxies (P2P) Framework.P2P operationalizes preference alignment as a two-stage process inspired by revealed preference theory. In Stage 1, active endowment generation uses a structured attribute bank and entropy-guided adaptive sampling strategies to construct diverse agent personas, called endowments. In Stage 2, regression-based aggregation assigns weights to reconstruct population-level preference patterns. An external validity check accesses how well the weighted ensemble predicts aggregate responses on held-out questions, assessing out-of-sample generalization.
  • Figure 3: Error Analysis of Hong Kong WVS Results. We compare UK-native and HK-native with HK-transfer (US endowments applied to HK via P2P Stage 2). (A) Ground truth divergence between populations ($r=0.848$). (B) Transfer outperforms native in overall performance, corroborating the functional basis perspective. (C) Error gap scales with cultural divergence, suggesting learnability limits (Definition \ref{['def:learnability_pop']}). (D) Despite higher test MSE, HK-native outperforms transfer on culturally divergent questions, indicating that localization helps where cultures genuinely differ.
  • Figure 4: Attribute Bank and Endowment Generation Workflow. The preset attribute bank is organized hierarchically by template (core, thematic, theoretical), mode (e.g., economics, politics, maslow, big five), and attribute (e.g., race, gender, ethnicity, urban/rural residency, risk aversion). Endowment is generated by a specialized LLM (EndowmentModel), which takes drawn attributes as input and outputs a profile with instantiation of these attributes.
  • Figure 5: Left: Entropy change across update steps for each question during active endowment generation for ATP W42. Each column represents a question. Green indicates an increase in entropy from the previous step, while red indicates a decline. Right: Entropy trajectories by question (line chart). Active endowment generation progressively increases response diversity. Trends are classified based on slope and volatility of the entropy trajectory: “rising” if the slope exceeds 0.1 and standard deviation is above 0.02; “falling” if the slope is below –0.1 with sufficient volatility; otherwise labeled “stable”.
  • ...and 14 more figures

Theorems & Definitions (15)

  • Definition 2.0
  • Definition 2.0
  • Proposition 1.0: Existence of a representative agent
  • proof
  • Remark 1.1: Sufficient condition for representative agent
  • proof
  • Definition 1.1
  • Definition 1.2
  • Definition 1.2
  • Definition 1.3
  • ...and 5 more