Table of Contents
Fetching ...

A Contemporary Survey on Semantic Communications:Theory of Mind, Generative AI, and Deep Joint Source-Channel Coding

Loc X. Nguyen, Avi Deb Raha, Pyae Sone Aung, Dusit Niyato, Zhu Han, Choong Seon Hong

TL;DR

Semantic communications aim to reduce transmission overhead by conveying meaning rather than exact data, addressing latency and reliability in next-generation networks. The paper surveys three leading directions—Theory of Mind (ToM), Generative AI (GAI), and Deep Joint Source-Channel Coding (DJSCC)—covering foundational theory, representative works, and open challenges for each. It highlights cross-direction interactions, practical deployments (including edge-cloud collaborations and VR/vehicular contexts), and future opportunities such as quantum-enabled semantic networks. The findings suggest that while each direction offers distinct advantages in efficiency, reasoning, and robustness, standardization and scalable, secure implementations will shape the path toward practical semantic communication in 6G and beyond.

Abstract

Semantic communication is emerging as the next pillar in wireless communication technology due to its transformative capabilities in reducing communication overhead, enhancing robustness, and enabling intelligent information exchange. The most significant obstacle lies in the lack of standardization across various research directions, leading to inconsistencies in interpretation, objectives, and evaluation. In this survey, we provide an in-depth overview of three leading directions in semantic communication, namely Theory of Mind-based semantic communication, Generative AI-driven semantic communication, and Deep Joint Source-Channel Coding (DJSCC)-based semantic communication. These directions have been extensively studied and developed by research institutes worldwide, and their effectiveness continues to improve alongside advances in communication and computing technologies. The ToM-based semantic communication enables communication agents to interact intelligently, infer each other's intentions, and gradually form a shared understanding. The GAI-based semantic communication leverages generative models to create and interpret content beyond traditional compression, allowing flexible semantic encoding and decoding tailored to specific tasks. The DJSCC-based semantic communication direction integrates DL models to jointly optimize the source and channel coding processes for efficient semantic information transfer. Next, we present a detailed survey of existing works under each direction and open research problems in semantic communication. Furthermore, we identify and analyze critical challenges, such as scalability and adaptability, that currently hinder the deployment of semantic communication systems. Finally, we discuss potential research opportunities and future directions such as quantum computing to further enhance the capabilities of semantic communication.

A Contemporary Survey on Semantic Communications:Theory of Mind, Generative AI, and Deep Joint Source-Channel Coding

TL;DR

Semantic communications aim to reduce transmission overhead by conveying meaning rather than exact data, addressing latency and reliability in next-generation networks. The paper surveys three leading directions—Theory of Mind (ToM), Generative AI (GAI), and Deep Joint Source-Channel Coding (DJSCC)—covering foundational theory, representative works, and open challenges for each. It highlights cross-direction interactions, practical deployments (including edge-cloud collaborations and VR/vehicular contexts), and future opportunities such as quantum-enabled semantic networks. The findings suggest that while each direction offers distinct advantages in efficiency, reasoning, and robustness, standardization and scalable, secure implementations will shape the path toward practical semantic communication in 6G and beyond.

Abstract

Semantic communication is emerging as the next pillar in wireless communication technology due to its transformative capabilities in reducing communication overhead, enhancing robustness, and enabling intelligent information exchange. The most significant obstacle lies in the lack of standardization across various research directions, leading to inconsistencies in interpretation, objectives, and evaluation. In this survey, we provide an in-depth overview of three leading directions in semantic communication, namely Theory of Mind-based semantic communication, Generative AI-driven semantic communication, and Deep Joint Source-Channel Coding (DJSCC)-based semantic communication. These directions have been extensively studied and developed by research institutes worldwide, and their effectiveness continues to improve alongside advances in communication and computing technologies. The ToM-based semantic communication enables communication agents to interact intelligently, infer each other's intentions, and gradually form a shared understanding. The GAI-based semantic communication leverages generative models to create and interpret content beyond traditional compression, allowing flexible semantic encoding and decoding tailored to specific tasks. The DJSCC-based semantic communication direction integrates DL models to jointly optimize the source and channel coding processes for efficient semantic information transfer. Next, we present a detailed survey of existing works under each direction and open research problems in semantic communication. Furthermore, we identify and analyze critical challenges, such as scalability and adaptability, that currently hinder the deployment of semantic communication systems. Finally, we discuss potential research opportunities and future directions such as quantum computing to further enhance the capabilities of semantic communication.

Paper Structure

This paper contains 46 sections, 9 figures, 8 tables.

Figures (9)

  • Figure 1: The structure and scope of the survey
  • Figure 2: The visualization of different directions for semantic communication.
  • Figure 3: The visualization of a general Theory of Mind-based Semantic Communication.
  • Figure 4: The visualization of different knowledge representations.
  • Figure 5: The visualization of the architecture of Generative AI-based Semantic Communication Direction.
  • ...and 4 more figures