StegOT: Trade-offs in Steganography via Optimal Transport
Chengde Lin, Xuezhu Gong, Shuxue Ding, Mingzhe Yang, Xijun Lu, Chengjun Mo
TL;DR
StegOT addresses the information balance challenge in image steganography by integrating an autoencoder with a Multi-Channel Optimal Transport module to transform a multi‑peak latent distribution into a single-peak form. This redistribution enables simultaneous preservation of cover information and recovery of the secret, improving both stego and recovered image quality. Empirical results on DIV2K, COCO, and ImageNet show superior PSNR and lower perceptual distance (LPIPS) compared with prior methods, along with robustness under distortions. The approach provides a theoretically motivated and practically effective framework for more reliable steganography with higher fidelity and resilience.
Abstract
Image hiding is often referred to as steganography, which aims to hide a secret image in a cover image of the same resolution. Many steganography models are based on genera-tive adversarial networks (GANs) and variational autoencoders (VAEs). However, most existing models suffer from mode collapse. Mode collapse will lead to an information imbalance between the cover and secret images in the stego image and further affect the subsequent extraction. To address these challenges, this paper proposes StegOT, an autoencoder-based steganography model incorporating optimal transport theory. We designed the multiple channel optimal transport (MCOT) module to transform the feature distribution, which exhibits multiple peaks, into a single peak to achieve the trade-off of information. Experiments demonstrate that we not only achieve a trade-off between the cover and secret images but also enhance the quality of both the stego and recovery images. The source code will be released on https://github.com/Rss1124/StegOT.
