Provably Secure Covert Messaging Using Image-based Diffusion Processes
Luke A. Bauer, Wenxuan Bao, Vincent Bindschaedler
TL;DR
This work tackles covert messaging through diffusion-model outputs by embedding ciphertext in the initial latent space without distorting latent distributions, achieving provable latent-space indistinguishability. It introduces a four-part construction (setup, cryptographic record, embedding, retrieval) that uses a shared key and a thresholded, redundant latent-sign flip scheme, augmented by EDICT dual latents for improved inversion accuracy and error correction. The authors formalize latent-space security via an indistinguishability game and prove distribution preservation under symmetric latent priors, while empirically showing strong security against latent-space detectors and robust covertness under common image transformations. They also analyze performance through capacity-reliability tradeoffs, demonstrate practical processing times, and illustrate robustness improvements over alternative embedding strategies. The approach enables a secure, transformation-resistant covert channel compatible with off-the-shelf latent diffusion models, with potential applications in deniable storage and secure low-capacity communication.
Abstract
We consider the problem of securely and robustly embedding covert messages into an image-based diffusion model's output. The sender and receiver want to exchange the maximum amount of information possible per diffusion sampled image while remaining undetected. The adversary wants to detect that such communication is taking place by identifying those diffusion samples that contain covert messages. To maximize robustness to transformations of the diffusion sample, a strategy is for the sender and the receiver to embed the message in the initial latents. We first show that prior work that attempted this is easily broken because their embedding technique alters the latents' distribution. We then propose a straightforward method to embed covert messages in the initial latent {\em without} altering the distribution. We prove that our construction achieves indistinguishability to any probabilistic polynomial time adversary. Finally, we discuss and analyze empirically the tradeoffs between embedding capacity, message recovery rates, and robustness. We find that optimizing the inversion method for error correction is crucial for reliability.
