Auxiliary-Hyperparameter-Free Sampling: Entropy Equilibrium for Text Generation
Xiaodong Cai, Hai Lin, Shaoxiong Zhan, Weiqi Luo, Hong-Gee Kim, Hongyan Hao, Yu Yang, Hai-Tao Zheng
TL;DR
The paper tackles the sensitivity and deployment burden of hyperparameter tuning in token sampling for large language models. It introduces Entropy Equilibrium Sampling (EES), an auxiliary-hyperparameter-free method that selects a dynamic candidate set by enforcing an entropy-mass equilibrium, and provides theoretical guarantees of existence and uniqueness for the equilibrium threshold. Through extensive experiments across multiple models and tasks, the authors demonstrate that EES achieves competitive accuracy and coherence while maintaining robust performance across temperature settings and eliminating hyperparameter dependence. The work offers a practical sampling alternative with favorable deployment properties and analyzes its impact on diversity, creative writing benchmarks, and human/LLM evaluations.
Abstract
Token sampling strategies critically influence text generation quality in large language models (LLMs). However, existing methods introduce additional hyperparameters, requiring extensive tuning and complicating deployment. We present Entropy Equilibrium Sampling (EES), an auxiliary hyperparameter-free approach inspired by information theory that can dynamically adjust candidate sets by balancing normalized entropy with probability mass. We evaluate EES on both reasoning and generation tasks across a range of model architectures. Our results show that EES consistently performs well across temperature settings, delivering competitive accuracy and coherence while maintaining diversity. By eliminating the need for hyperparameter tuning, EES greatly simplifies deployment while improving performance. Code is available at https://github.com/shuanncai/EES
