Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations

Manh Hung Nguyen; Sebastian Tschiatschek; Adish Singla

Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations

Manh Hung Nguyen, Sebastian Tschiatschek, Adish Singla

TL;DR

The paper addresses the challenge that single LLMs often produce homogeneous outputs insufficient to represent diverse human populations. It introduces a framework to construct a representative ensemble of LLM agents, each guided by $K$ demonstrations through in-context learning, and formulates the agent-selection problem as a submodular optimization of prompts. Three scalable methods (RepPop_demo, RepPop_mapped_1, RepPop_mapped_2) offer different trade-offs between computation and performance, with theoretical guarantees and practical validation in education and crowdsourcing domains. Empirically, the ensemble of agents better captures population-wide behavior than baselines and generalizes to unseen tasks, enabling more faithful simulations of diverse human perspectives for research and evaluation.

Abstract

The difficulty and expense of obtaining large-scale human responses make Large Language Models (LLMs) an attractive alternative and a promising proxy for human behavior. However, prior work shows that LLMs often produce homogeneous outputs that fail to capture the rich diversity of human perspectives and behaviors. Thus, rather than trying to capture this diversity with a single LLM agent, we propose a novel framework to construct a set of agents that collectively capture the diversity of a given human population. Each agent is an LLM whose behavior is steered by conditioning on a small set of human demonstrations (task-response pairs) through in-context learning. The central challenge is therefore to select a representative set of LLM agents from the exponentially large space of possible agents. We tackle this selection problem from the lens of submodular optimization. In particular, we develop methods that offer different trade-offs regarding time complexity and performance guarantees. Extensive experiments in crowdsourcing and educational domains demonstrate that our approach constructs agents that more effectively represent human populations compared to baselines. Moreover, behavioral analyses on new tasks show that these agents reproduce the behavior patterns and perspectives of the students and annotators they are designed to represent.

Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations

TL;DR

Abstract

Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (6)