CPR: Mitigating Large Language Model Hallucinations with Curative Prompt Refinement
Jung-Woo Shim, Yeong-Joon Ju, Ji-Hoon Park, Seong-Whan Lee
TL;DR
The paper addresses the problem of hallucinations in large language models caused by ill-formed prompts. It introduces Curative Prompt Refinement (CPR), a plug-and-play pipeline that uses a fine-tuned small language model (via LoRA) to clean prompts and generate informative task descriptions, followed by perplexity-based reranking to assemble a final, well-formed prompt. The authors provide a dataset and training regime for prompt refinement, demonstrate improved output quality and reduced hallucinations across multiple LLMs, and show competitive performance against SelfCheckGPT, especially on highly ill-formed prompts. The work emphasizes a lightweight, model-agnostic preprocessing approach that can be adopted across diverse inference settings to enhance reliability without requiring external knowledge resources. Overall, CPR advances practical LLM reliability by focusing on input quality and informative prompt enrichment rather than solely on model internals or post-hoc corrections.
Abstract
Recent advancements in large language models (LLMs) highlight their fluency in generating responses to diverse prompts. However, these models sometimes generate plausible yet incorrect ``hallucinated" facts, undermining trust. A frequent but often overlooked cause of such errors is the use of poorly structured or vague prompts by users, leading LLMs to base responses on assumed rather than actual intentions. To mitigate hallucinations induced by these ill-formed prompts, we introduce Curative Prompt Refinement (CPR), a plug-and-play framework for curative prompt refinement that 1) cleans ill-formed prompts, and 2) generates additional informative task descriptions to align the intention of the user and the prompt using a fine-tuned small language model. When applied to language models, we discover that CPR significantly increases the quality of generation while also mitigating hallucination. Empirical studies show that prompts with CPR applied achieves over a 90\% win rate over the original prompts without any external knowledge.
