Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts
Ming Wang, Yuanzhong Liu, Xiaoyu Liang, Yijie Huang, Daling Wang, Xiaocui Yang, Sijia Shen, Shi Feng, Xiaoming Zhang, Chaofeng Guan, Yifei Zhang
TL;DR
This work introduces LangGPT, a programming-language–inspired structural prompt framework with a dual-layer design of modules and elements to improve generalization and reuse of prompts for non-AI experts. It further presents Minstrel, a multi-agent system with reflection that coordinates three groups—Analysis, Design, and Test—to automatically generate and refine LangGPT prompts. Empirical results show that structural prompts generated by LangGPT or manually crafted outperform baselines, with Minstrel prompts approaching or surpassing human-written prompts across a range of benchmarks and LLMs; a user study supports high ease of use. However, gains are reduced for lower-capacity LLMs, motivating future work to optimize prompts for weaker models and broaden usability.
Abstract
LLMs have demonstrated commendable performance across diverse domains. Nevertheless, formulating high-quality prompts to assist them in their work poses a challenge for non-AI experts. Existing research in prompt engineering suggests somewhat scattered optimization principles and designs empirically dependent prompt optimizers. Unfortunately, these endeavors lack a structural design, incurring high learning costs and it is not conducive to the iterative updating of prompts, especially for non-AI experts. Inspired by structured reusable programming languages, we propose LangGPT, a structural prompt design framework. Furthermore, we introduce Minstrel, a multi-generative agent system with reflection to automate the generation of structural prompts. Experiments and the case study illustrate that structural prompts generated by Minstrel or written manually significantly enhance the performance of LLMs. Furthermore, we analyze the ease of use of structural prompts through a user survey in our online community.
