Learning to Rewrite Prompts for Bootstrapping LLMs on Downstream Tasks
Qinhao Zhou, Xiang Xiang, Kun He, John E. Hopcroft
TL;DR
This work tackles the problem that prompt prompts for LLMs often rely on fixed instructions, but for many NLG tasks the input content drives performance. It introduces ROI, a lightweight pipeline that rewrites the input component using a small rewriting model learned via back-translation, supplemented by a similarity-based filter to remove noisy rewrites, all without updating the LLM. The approach demonstrates consistent improvements across multi-domain machine translation, summarization, and GLUE benchmarks, highlighting gains in BLEU, RougeL, Edit Rate, accuracy, and F1 while reducing the need for expensive LLM fine-tuning. The methods offer a practical, training-efficient route to better prompt alignment with LLM preferences, with potential applicability to a broad range of downstream tasks beyond translation and sentiment analysis.
Abstract
In recent years, the growing interest in Large Language Models (LLMs) has significantly advanced prompt engineering, transitioning from manual design to model-based optimization. Prompts for LLMs generally comprise two components: the \textit{instruction}, which defines the task or objective, and the \textit{input}, which is tailored to the instruction type. In natural language generation (NLG) tasks such as machine translation, the \textit{input} component is particularly critical, while the \textit{instruction} component tends to be concise. Existing prompt engineering methods primarily focus on optimizing the \textit{instruction} component for general tasks, often requiring large-parameter LLMs as auxiliary tools. However, these approaches exhibit limited applicability for tasks like machine translation, where the \textit{input} component plays a more pivotal role. To address this limitation, this paper introduces a novel prompt optimization method specifically designed for machine translation tasks. The proposed approach employs a small-parameter model trained using a back-translation-based strategy, significantly reducing training overhead for single-task optimization while delivering highly effective performance. With certain adaptations, this method can also be extended to other downstream tasks.
