Table of Contents
Fetching ...

Computation and Critical Transitions of Rate-Distortion-Perception Functions With Wasserstein Barycenter

Chunhui Chen, Xueyan Niu, Wenhao Ye, Hao Wu, Bo Bai

TL;DR

A reverse data hiding scheme that imperceptibly embeds a secret message into an image, ensuring perceptual fidelity and achieving a significant improvement in the perceptual quality of the stego image compared to traditional methods under the same embedding rate is proposed.

Abstract

The information rate-distortion-perception (RDP) function characterizes the three-way trade-off between description rate, average distortion, and perceptual quality measured by discrepancy between probability distributions and has been applied to emerging areas in communications empowered by generative modeling. We study several variants of the RDP functions through the lens of optimal transport to characterize their critical transitions. By transforming the information RDP function into a Wasserstein Barycenter problem, we identify the critical transitions when one of the constraints becomes inactive. Further, the non-strictly convexity brought by the perceptual constraint can be regularized by an entropy regularization term. We prove that the entropy regularized model converges to the original problem and propose an alternating iteration method based on the Sinkhorn algorithm to numerically solve the regularized optimization problem. In many practical scenarios, the computation of the Distortion-Rate-Perception (DRP) function offers a solution to minimize distortion and perceptual discrepancy under rate constraints. However, the interchange of the rate objective and the distortion constraint significantly amplifies the complexity. The proposed method effectively addresses this complexity, providing an efficient solution for DRP functions. Using our numerical method, we propose a reverse data hiding scheme that imperceptibly embeds a secret message into an image, ensuring perceptual fidelity and achieving a significant improvement in the perceptual quality of the stego image compared to traditional methods under the same embedding rate. Our theoretical results and numerical method lay an attractive foundation for steganographic communications with perceptual quality constraints.

Computation and Critical Transitions of Rate-Distortion-Perception Functions With Wasserstein Barycenter

TL;DR

A reverse data hiding scheme that imperceptibly embeds a secret message into an image, ensuring perceptual fidelity and achieving a significant improvement in the perceptual quality of the stego image compared to traditional methods under the same embedding rate is proposed.

Abstract

The information rate-distortion-perception (RDP) function characterizes the three-way trade-off between description rate, average distortion, and perceptual quality measured by discrepancy between probability distributions and has been applied to emerging areas in communications empowered by generative modeling. We study several variants of the RDP functions through the lens of optimal transport to characterize their critical transitions. By transforming the information RDP function into a Wasserstein Barycenter problem, we identify the critical transitions when one of the constraints becomes inactive. Further, the non-strictly convexity brought by the perceptual constraint can be regularized by an entropy regularization term. We prove that the entropy regularized model converges to the original problem and propose an alternating iteration method based on the Sinkhorn algorithm to numerically solve the regularized optimization problem. In many practical scenarios, the computation of the Distortion-Rate-Perception (DRP) function offers a solution to minimize distortion and perceptual discrepancy under rate constraints. However, the interchange of the rate objective and the distortion constraint significantly amplifies the complexity. The proposed method effectively addresses this complexity, providing an efficient solution for DRP functions. Using our numerical method, we propose a reverse data hiding scheme that imperceptibly embeds a secret message into an image, ensuring perceptual fidelity and achieving a significant improvement in the perceptual quality of the stego image compared to traditional methods under the same embedding rate. Our theoretical results and numerical method lay an attractive foundation for steganographic communications with perceptual quality constraints.
Paper Structure (36 sections, 17 theorems, 102 equations, 10 figures, 1 table, 2 algorithms)

This paper contains 36 sections, 17 theorems, 102 equations, 10 figures, 1 table, 2 algorithms.

Key Result

Theorem 1

The optimal solution to the RDP function, as defined by equation eq0_0, coincides with the optimal solution of the subsequent optimization problem when the distance metric $d(\cdot,\cdot)$ in equation eq0_0_d is specified as the Wasserstein metric. Furthermore, the pair $(\bm{w},\bm{r})$, which repr

Figures (10)

  • Figure 1: The schematic diagram of distortion-perception cross-sections and the corresponding transition curves where one of the constraints is removed.
  • Figure 2: The rate-distortion-perception functions obtained by our method and a) the binary source with TV distance and its analytical solution; b) the binary source with KL divergence; c) the Gaussian source with Wasserstein-2 metric and its analytical solution; d) the Gaussian source with TV distance.
  • Figure 3: The rate-distortion-perception functions obtained by our method. a) the binary source with TV distance. b) the binary source with KL divergence. c) the Gaussian source with Wasserstein-2 metric d) the Gaussian source with TV distance.
  • Figure 4: The convergent trajectories of $r$ from our method and a) the binary source with different distortion parameters; b) the Gaussian source with different distortion parameters.
  • Figure 5: Distortion-perception cross-sections under different rates obtained by our method and a) the binary source with TV distance and its analytical critical transition curve; b) the binary source with KL divergence; c) the Gaussian source with Wasserstein-2 metric and its analytical critical transition curve; d) the Gaussian source with TV distance. Red cross points are the critical transition points obtained by our method.
  • ...and 5 more figures

Theorems & Definitions (49)

  • Definition 1: information RDP function blau2019rethinking
  • Theorem 1
  • Definition 2
  • Definition 3
  • Proposition 1: blau2019rethinking
  • Definition 4
  • Theorem 2
  • Remark 1
  • Remark 2
  • Remark 3
  • ...and 39 more