PSyDUCK: Training-Free Steganography for Latent Diffusion
Aqib Mahfuz, Georgia Channing, Mark van der Wilk, Philip Torr, Fabio Pizzati, Christian Schroeder de Witt
TL;DR
PSyDUCK tackles secure, high-capacity steganography with latent diffusion models by providing a training-free, model-agnostic framework that leverages controlled divergence and local mixing in the denoising process. It extends to latent-space video diffusion, delivering superior encoding capacity and robustness over pixel-space baselines and prior latent approaches, without retraining. The work offers theoretical guarantees of bounded security error and indistinguishable-noise security, complemented by extensive image and video experiments showing high recovery accuracy and low detectability. Collectively, PSyDUCK enables practical, scalable generative steganography for real-world applications using latent diffusion models.
Abstract
Recent advances in generative AI have opened promising avenues for steganography, which can securely protect sensitive information for individuals operating in hostile environments, such as journalists, activists, and whistleblowers. However, existing methods for generative steganography have significant limitations, particularly in scalability and their dependence on retraining diffusion models. We introduce PSyDUCK, a training-free, model-agnostic steganography framework specifically designed for latent diffusion models. PSyDUCK leverages controlled divergence and local mixing within the latent denoising process, enabling high-capacity, secure message embedding without compromising visual fidelity. Our method dynamically adapts embedding strength to balance accuracy and detectability, significantly improving upon existing pixel-space approaches. Crucially, PSyDUCK extends generative steganography to latent-space video diffusion models, surpassing previous methods in both encoding capacity and robustness. Extensive experiments demonstrate PSyDUCK's superiority over state-of-the-art techniques, achieving higher transmission accuracy and lower detectability rates across diverse image and video datasets. By overcoming the key challenges associated with latent diffusion model architectures, PSyDUCK sets a new standard for generative steganography, paving the way for scalable, real-world steganographic applications.
