Retrieval-augmented Generation for GenAI-enabled Semantic Communications
Shunpu Tang, Ruichen Zhang, Yuxuan Yan, Qianqian Yang, Dusit Niyato, Xianbin Wang, Shiwen Mao
TL;DR
The paper addresses semantic inconsistency, limited adaptability, and lack of knowledge accumulation in GenSemCom. It proposes a RAG-enabled GenSemCom architecture that injects external knowledge via a knowledge base, an intelligent retriever, and a knowledge-aware encoder/decoder to guide semantic encoding and decoding. A case study on image transmission using a diffusion-based GenSemCom system with multi-modal prompts demonstrates improved semantic fidelity and image reconstruction across varying $BER$ levels, quantified by metrics such as $CLIP$ similarity, $LPIPS$, $PIEAPP$, and $MS-SSIM$. The findings suggest that RAG significantly improves robustness and efficiency of GenSemCom, with future directions including adaptive retrieval, knowledge-base synchronization, and privacy/security considerations.
Abstract
Semantic communication (SemCom) is an emerging paradigm aiming at transmitting only task-relevant semantic information to the receiver, which can significantly improve communication efficiency. Recent advancements in generative artificial intelligence (GenAI) have empowered GenAI-enabled SemCom (GenSemCom) to further expand its potential in various applications. However, current GenSemCom systems still face challenges such as semantic inconsistency, limited adaptability to diverse tasks and dynamic environments, and the inability to leverage insights from past transmission. Motivated by the success of retrieval-augmented generation (RAG) in the domain of GenAI, this paper explores the integration of RAG in GenSemCom systems. Specifically, we first provide a comprehensive review of existing GenSemCom systems and the fundamentals of RAG techniques. We then discuss how RAG can be integrated into GenSemCom. Following this, we conduct a case study on semantic image transmission using an RAG-enabled diffusion-based SemCom system, demonstrating the effectiveness of the proposed integration. Finally, we outline future directions for advancing RAG-enabled GenSemCom systems.
