Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

Jiayi Zhang; Simon Yu; Derek Chong; Anthony Sicilia; Michael R. Tomz; Christopher D. Manning; Weiyan Shi

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

Jiayi Zhang, Simon Yu, Derek Chong, Anthony Sicilia, Michael R. Tomz, Christopher D. Manning, Weiyan Shi

TL;DR

The paper identifies typicality bias in human preference data as a fundamental driver of mode collapse observed after post-training alignment. It formalizes the effect within a reward-and- KL-regularized optimization framework and introduces Verbalized Sampling (VS), a training-free prompting strategy that elicits a distributed set of responses with corresponding probabilities to recover the base model’s diversity. Across creative writing, dialogue simulation, open-ended QA, and synthetic data generation, VS significantly enhances output diversity while preserving factual accuracy and safety, with larger, more capable models benefiting more from the approach. This work offers a data-centric lens on alignment and provides a practical, inference-time remedy to unlock LLM creative potential and diversity without additional training.

Abstract

Post-training alignment often reduces LLM diversity, leading to a phenomenon known as mode collapse. Unlike prior work that attributes this effect to algorithmic limitations, we identify a fundamental, pervasive data-level driver: typicality bias in preference data, whereby annotators systematically favor familiar text as a result of well-established findings in cognitive psychology. We formalize this bias theoretically, verify it on preference datasets empirically, and show that it plays a central role in mode collapse. Motivated by this analysis, we introduce Verbalized Sampling, a simple, training-free prompting strategy to circumvent mode collapse. VS prompts the model to verbalize a probability distribution over a set of responses (e.g., "Generate 5 jokes about coffee and their corresponding probabilities"). Comprehensive experiments show that VS significantly improves performance across creative writing (poems, stories, jokes), dialogue simulation, open-ended QA, and synthetic data generation, without sacrificing factual accuracy and safety. For instance, in creative writing, VS increases diversity by 1.6-2.1x over direct prompting. We further observe an emergent trend that more capable models benefit more from VS. In sum, our work provides a new data-centric perspective on mode collapse and a practical inference-time remedy that helps unlock pre-trained generative diversity.

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

TL;DR

Abstract

Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (30)

Theorems & Definitions (3)