Language Model Prompt Selection via Simulation Optimization
Haoting Zhang, Jinghai He, Rhonda Righter, Zeyu Zheng
TL;DR
This paper introduces a two-stage framework that uses simulation optimization to select prompts for pre-trained language models without additional fine-tuning. It first builds a finite, informative soft-prompt set via a text autoencoder and PCA, then uses a Bayesian surrogate with an acquisition function to sequentially evaluate prompts, with a refinement step via Projection Stochastic Kriging to improve latent mappings. The authors demonstrate that a Bayesian neural network surrogate with M-UCB or PR-M-UCB acquisition outperforms direct latent-space search under finite budgets and provide consistency proofs and practical guidance, including hyperparameter tuning through stochastic kriging. The approach is practical for small organizations seeking efficient, model-agnostic prompt optimization and offers broad applicability to other LM-based management tasks. Key contributions include a rigorously framed two-stage framework, consistency guarantees, and a thorough empirical comparison of surrogate models and acquisition strategies. $v(z)$, $h( ilde{y},y)$, and acquisition terms such as $ ext{M-UCB}$ are central to the optimization process, and latent-to-text mappings are enabled by a text autoencoder coupled with PCA.
Abstract
With the advancement in generative language models, the selection of prompts has gained significant attention in recent years. A prompt is an instruction or description provided by the user, serving as a guide for the generative language model in content generation. Despite existing methods for prompt selection that are based on human labor, we consider facilitating this selection through simulation optimization, aiming to maximize a pre-defined score for the selected prompt. Specifically, we propose a two-stage framework. In the first stage, we determine a feasible set of prompts in sufficient numbers, where each prompt is represented by a moderate-dimensional vector. In the subsequent stage for evaluation and selection, we construct a surrogate model of the score regarding the moderate-dimensional vectors that represent the prompts. We propose sequentially selecting the prompt for evaluation based on this constructed surrogate model. We prove the consistency of the sequential evaluation procedure in our framework. We also conduct numerical experiments to demonstrate the efficacy of our proposed framework, providing practical instructions for implementation.
