Fundamental Limitation of Semantic Communications: Neural Estimation for Rate-Distortion
Dongxu Li, Jianhao Huang, Chuan Huang, Xiaoqi Qin, Han Zhang, Ping Zhang
TL;DR
This work analyzes the fundamental limits of semantic communications via the semantic rate-distortion function (SRDF) for a source pair $(X,S)$ over a discrete memoryless channel. It introduces NESRD, a neural estimator that learns a generative model to approximate the SRDF when the semantic source distribution is unknown, and also presents a cascade network variant for deterministic $S=h(X)$ and a Blahut-Arimoto algorithm for the perfectly known distribution case. The results demonstrate the neural estimator's strong consistency and provide practical SRDF computation methods, validated on joint Gaussian sources and datasets such as MNIST and SVHN. Overall, the paper advances theoretical understanding and practical estimation tools for rate-distortion tradeoffs in semantic communications, enabling more efficient joint source-channel design under distribution uncertainty.
Abstract
This paper studies the fundamental limit of semantic communications over the discrete memoryless channel. We consider the scenario to send a semantic source consisting of an observation state and its corresponding semantic state, both of which are recovered at the receiver. To derive the performance limitation, we adopt the semantic rate-distortion function (SRDF) to study the relationship among the minimum compression rate, observation distortion, semantic distortion, and channel capacity. For the case with unknown semantic source distribution, while only a set of the source samples is available, we propose a neural-network-based method by leveraging the generative networks to learn the semantic source distribution. Furthermore, for a special case where the semantic state is a deterministic function of the observation, we design a cascade neural network to estimate the SRDF. For the case with perfectly known semantic source distribution, we propose a general Blahut-Arimoto algorithm to effectively compute the SRDF. Finally, experimental results validate our proposed algorithms for the scenarios with ideal Gaussian semantic source and some practical datasets.
