Deep Generative Modeling Reshapes Compression and Transmission: From Efficiency to Resiliency
Jincheng Dai, Xiaoqi Qin, Sixian Wang, Lexi Xu, Kai Niu, Ping Zhang
TL;DR
This work frames deep probabilistic generative modeling as a bridge between Shannon information theory's data compression and distortion correction for reliable transmission, proposing end-to-end generative approaches to improve both efficiency and resiliency. It analyzes neural compression across continuous (infinite) and discrete (finite) quantization regimes, introduces nonlinear transform coding (NTC), and delineates four paradigms—lossless, lossy, perceptual, and semantic—within rate–distortion trade-offs ($R$-$D$) and rate–distortion–perception trade-offs ($R$-$D$-$P$). It advances joint source–channel coding by presenting strongly-coupled JSCC with end-to-end learned mappings and weakly-coupled JSCC that relies on latent-packet loss concealment via masked-transformer priors, including nonlinear transform source–channel coding (NTSCC) and latent conditioning for posterior sampling. The findings demonstrate that generative priors enable graceful degradation and improved resilience under adverse channels, while token-based semantic compression and MT-based latent modeling push toward robust, semantically meaningful end-to-end communication. Overall, the paper gives a unified perspective linking foundation generative models with both source and channel coding to enable efficient, resilient, and potentially semantically aware communications, and it frames future work at the intersection of compression, transmission, and higher-level cognitive capabilities.
Abstract
Information theory and machine learning are inextricably linked and have even been referred to as "two sides of the same coin". One particularly elegant connection is the essential equivalence between probabilistic generative modeling and data compression or transmission. In this article, we reveal the dual-functionality of deep generative models that reshapes both data compression for efficiency and transmission error concealment for resiliency. We present how the contextual predictive capabilities of powerful generative models can be well positioned to be strong compressors and estimators. In this sense, we advocate for viewing the deep generative modeling problem through the lens of end-to-end communications, and evaluate the compression and error restoration capabilities of foundation generative models. We show that the kernel of many large generative models is powerful predictor that can capture complex relationships among semantic latent variables, and the communication viewpoints provide novel insights into semantic feature tokenization, contextual learning, and usage of deep generative models. In summary, our article highlights the essential connections of generative AI to source and channel coding techniques, and motivates researchers to make further explorations in this emerging topic.
