Semantic Arithmetic Coding using Synonymous Mappings
Zijian Liang, Kai Niu, Jin Xu, Ping Zhang
TL;DR
This work introduces semantic arithmetic coding (SAC), a semantic-source-coding framework that uses carefully designed synonymous mappings to partition semantic information into synonymous sets and perform arithmetic coding over those sets. By tying semantic content to synonymous structures, SAC achieves compression near the semantic entropy $H_s(\tilde{\mathcal{U}})$ while preserving meaning, demonstrated through an edge texture map experiment where semantic integrity is maintained and compression improves over conventional arithmetic coding. The authors extend the code-length theorem to the semantic domain, showing that the average code length can approach $H_s(\tilde{\mathcal{U}})$ as the sequence length grows, and provide empirical evidence of lossless semantic reconstruction with meaningful reductions in bits. Overall, SAC offers a principled approach to semantic lossless compression with theoretically grounded limits and practical effectiveness for semantics-rich sources.
Abstract
Recent semantic communication methods explore effective ways to expand the communication paradigm and improve the system performance of the communication systems. Nonetheless, the common problem of these methods is that the essence of semantics is not explicitly pointed out and directly utilized. A new epistemology suggests that synonymy, which is revealed as the fundamental feature of semantics, guides the establishment of the semantic information theory from a novel viewpoint. Building on this theoretical basis, this paper proposes a semantic arithmetic coding (SAC) method for semantic lossless compression using intuitive semantic synonymy. By constructing reasonable synonymous mappings and performing arithmetic coding procedures over synonymous sets, SAC can achieve higher compression efficiency for meaning-contained source sequences at the semantic level and thereby approximate the semantic entropy limits. Experimental results on edge texture map compression show an evident improvement in coding efficiency using SAC without semantic losses, compared to traditional arithmetic coding, which demonstrates its effectiveness.
