Plum: Prompt Learning using Metaheuristic
Rui Pan, Shuo Xing, Shizhe Diao, Wenhe Sun, Xiang Liu, Kashun Shum, Renjie Pi, Jipeng Zhang, Tong Zhang
TL;DR
The paper frames prompt learning for LLMs as a discrete, gradient-free black-box optimization problem and introduces Plum, a general metaheuristic-based framework. Plum defines a neighborhood, a chosen metaheuristic, and scoring functions to automatically search for effective prompts, enabling automatic, discrete, interpretable optimization. Through implementations of six algorithms (Hill Climbing, Simulated Annealing, Genetic Algorithm variants, Tabu Search, and Harmony Search), Plum demonstrates competitive performance in white-box and black-box settings and reveals human-understandable prompt patterns for reasoning and image generation. The work also explores prompt discovery, efficiency considerations, and potential integration with LLM-driven mutators, highlighting Plum as a flexible, extensible template for prompt optimization across modalities and tasks.
Abstract
Since the emergence of large language models, prompt learning has become a popular method for optimizing and customizing these models. Special prompts, such as Chain-of-Thought, have even revealed previously unknown reasoning capabilities within these models. However, the progress of discovering effective prompts has been slow, driving a desire for general prompt optimization methods. Unfortunately, few existing prompt learning methods satisfy the criteria of being truly "general", i.e., automatic, discrete, black-box, gradient-free, and interpretable all at once. In this paper, we introduce metaheuristics, a branch of discrete non-convex optimization methods with over 100 options, as a promising approach to prompt learning. Within our paradigm, we test six typical methods: hill climbing, simulated annealing, genetic algorithms with/without crossover, tabu search, and harmony search, demonstrating their effectiveness in white-box and black-box prompt learning. Furthermore, we show that these methods can be used to discover more human-understandable prompts that were previously unknown in both reasoning and image generation tasks, opening the door to a cornucopia of possibilities in prompt optimization. We release all the codes in \url{https://github.com/research4pan/Plum}.
