HalluClean: A Unified Framework to Combat Hallucinations in LLMs
Yaxin Zhao, Yu Zhang
TL;DR
Hallucinations in LLM outputs undermine factual reliability across many tasks. HalluClean presents a zero-shot, task-agnostic framework that uses structured reasoning and a plan-and-solve style to detect and revise hallucinations without external knowledge sources. The method demonstrates strong, cross-task performance on QA, dialogue, summarization, math word problems, and self-contradiction detection, with domain robustness and compatibility with open-source models. The work provides interpretable, modular components and releases prompts and resources to support practical deployment in privacy-sensitive and resource-constrained settings.
Abstract
Large language models (LLMs) have achieved impressive performance across a wide range of natural language processing tasks, yet they often produce hallucinated content that undermines factual reliability. To address this challenge, we introduce HalluClean, a lightweight and task-agnostic framework for detecting and correcting hallucinations in LLM-generated text. HalluClean adopts a reasoning-enhanced paradigm, explicitly decomposing the process into planning, execution, and revision stages to identify and refine unsupported claims. It employs minimal task-routing prompts to enable zero-shot generalization across diverse domains, without relying on external knowledge sources or supervised detectors. We conduct extensive evaluations on five representative tasks-question answering, dialogue, summarization, math word problems, and contradiction detection. Experimental results show that HalluClean significantly improves factual consistency and outperforms competitive baselines, demonstrating its potential to enhance the trustworthiness of LLM outputs in real-world applications.
