Hiding Images in Diffusion Models by Editing Learned Score Functions
Haoyu Chen, Yunqiao Yang, Nan Zhong, Kede Ma
TL;DR
This work addresses the challenge of securely hiding images within diffusion models by editing the learned score function at a single, privately chosen timestep. By introducing a secret-key–guided, one-step extraction mechanism and a hybrid parameter-efficient fine-tuning strategy, the approach substantially improves extraction fidelity and embedding efficiency while preserving the diffusion model’s original behavior. Empirical results demonstrate high PSNRs (e.g., $52.90$ for $32\times32$ and $39.33$ for $256\times256$), near-original FID values, and embedding times orders of magnitude faster than prior methods, with support for multi-recipient scenarios via independent keys. The method offers practical diffusion-based neural steganography with strong efficiency and robustness, and points toward future work in adaptive hiding strategies, theoretical bounds on detectability, and robustness to perturbations.
Abstract
Hiding data using neural networks (i.e., neural steganography) has achieved remarkable success across both discriminative classifiers and generative adversarial networks. However, the potential of data hiding in diffusion models remains relatively unexplored. Current methods exhibit limitations in achieving high extraction accuracy, model fidelity, and hiding efficiency due primarily to the entanglement of the hiding and extraction processes with multiple denoising diffusion steps. To address these, we describe a simple yet effective approach that embeds images at specific timesteps in the reverse diffusion process by editing the learned score functions. Additionally, we introduce a parameter-efficient fine-tuning method that combines gradient-based parameter selection with low-rank adaptation to enhance model fidelity and hiding efficiency. Comprehensive experiments demonstrate that our method extracts high-quality images at human-indistinguishable levels, replicates the original model behaviors at both sample and population levels, and embeds images orders of magnitude faster than prior methods. Besides, our method naturally supports multi-recipient scenarios through independent extraction channels.
