Semantic Communication based on Generative AI: A New Approach to Image Compression and Edge Optimization
Francesco Pezone
TL;DR
This work advances semantic communication by integrating generative AI with edge-network optimization to compress images semantically rather than bit-for-bit. It introduces modular SPIC and SQ-GAN frameworks that couple semantic maps with coarse reconstructions and utilize diffusion and VQ-based generative models to preserve meaning at low bitrates. A GoAL resource-allocation scheme based on Information Bottleneck and stochastic optimization demonstrates how to adapt compression and computation at the edge under latency and performance constraints. Extensive evaluation on semantic metrics (e.g., mIoU, ACC) and classic metrics on Cityscapes-like data shows superior semantic preservation at lower bitrates compared to traditional codecs, highlighting practical impact for autonomous systems and real-time edge inference. The work also provides a unified framework combining IB-based task relevance with diffusion/GAN-based reconstruction, enabling adaptive, real-time optimization in data-driven, resource-constrained networks.
Abstract
As digital technologies advance, communication networks face challenges in handling the vast data generated by intelligent devices. Autonomous vehicles, smart sensors, and IoT systems necessitate new paradigms. This thesis addresses these challenges by integrating semantic communication and generative models for optimized image compression and edge network resource allocation. Unlike bit-centric systems, semantic communication prioritizes transmitting meaningful data specifically selected to convey the meaning rather than obtain a faithful representation of the original data. The communication infrastructure can benefit to significant improvements in bandwidth efficiency and latency reduction. Central to this work is the design of semantic-preserving image compression using Generative Adversarial Networks and Denoising Diffusion Probabilistic Models. These models compress images by encoding only semantically relevant features, allowing for high-quality reconstruction with minimal transmission. Additionally, a Goal-Oriented edge network optimization framework is introduced, leveraging the Information Bottleneck principle and stochastic optimization to dynamically allocate resources and enhance efficiency. By integrating semantic communication into edge networks, this approach balances computational efficiency and communication effectiveness, making it suitable for real-time applications. The thesis compares semantic-aware models with conventional image compression techniques using classical and semantic evaluation metrics. Results demonstrate the potential of combining generative AI and semantic communication to create more efficient semantic-goal-oriented communication networks that meet the demands of modern data-driven applications.
