Hallucination-aware Optimization for Large Language Model-empowered Communications
Yinqiu Liu, Guangyuan Liu, Ruichen Zhang, Dusit Niyato, Zehui Xiong, Dong In Kim, Kaibin Huang, Hongyang Du
TL;DR
This paper addresses the reliability challenge of hallucinations in large language models applied to telecommunications. It offers a lifecycle-based analysis of hallucination causes, surveys model- and system-based mitigation strategies, and evaluates existing LLM-empowered communication schemes. The authors present a telecom-oriented case study that combines Direct Preference Optimization fine-tuning with a mobile-edge mixture-of-experts to reduce hallucinations and optimize quality of experience, achieving a 20.6% improvement in correctness and substantial QoE gains. The work provides a public telecom hallucination dataset (TeleQnA) and advocates a hybrid approach to enable trustworthy LLM-enabled telecom services, outlining future security, customization, and reasoning directions.
Abstract
Large Language Models (LLMs) have significantly advanced communications fields, such as Telecom Q\&A, mathematical modeling, and coding. However, LLMs encounter an inherent issue known as hallucination, i.e., generating fact-conflicting or irrelevant content. This problem critically undermines the applicability of LLMs in communication systems yet has not been systematically explored. Hence, this paper provides a comprehensive review of LLM applications in communications, with a particular emphasis on hallucination mitigation. Specifically, we analyze hallucination causes and summarize hallucination mitigation strategies from both model- and system-based perspectives. Afterward, we review representative LLM-empowered communication schemes, detailing potential hallucination scenarios and comparing the mitigation strategies they adopted. Finally, we present a case study of a Telecom-oriented LLM that utilizes a novel hybrid approach to enhance the hallucination-aware service experience. On the model side, we publish a Telecom hallucination dataset and apply direct preference optimization to fine-tune LLMs, resulting in a 20.6\% correct rate improvement. Moreover, we construct a mobile-edge mixture-of-experts architecture for optimal LLM expert activation. Our research aims to propel the field of LLM-empowered communications forward by detecting and minimizing hallucination impacts.
