Provable Secure Steganography Based on Adaptive Dynamic Sampling
Kaiyi Pang, Minhao Bai
TL;DR
This work tackles the limitation of Provably Secure Steganography (PSS) methods that require explicit model distributions by introducing Adaptive Dynamic Sampling (ADS), a seed-based sampling scheme that preserves the model's distribution without direct access. ADS encodes secret bits via a dynamic collision-set mechanism that maps N-bit prefixes to sampled tokens, expanding the set as needed to maintain capacity while ensuring correct decoding. The authors provide a formal security proof under standard PRG-based indistinguishability and demonstrate through extensive experiments on three LLMs and three datasets that ADS achieves high embedding capacity and entropy utilization with competitive efficiency and deterministic decoding. The approach significantly broadens the practicality of PSS by enabling secure embedding through API-like access, with strong theoretical guarantees and real-world performance comparable to distribution-dependent methods. Overall, ADS offers a scalable, provably secure steganographic framework for covert communication over modern language-model APIs.
Abstract
The security of private communication is increasingly at risk due to widespread surveillance. Steganography, a technique for embedding secret messages within innocuous carriers, enables covert communication over monitored channels. Provably Secure Steganography (PSS), which ensures computational indistinguishability between the normal model output and steganography output, is the state-of-the-art in this field. However, current PSS methods often require obtaining the explicit distributions of the model. In this paper, we propose a provably secure steganography scheme that only requires a model API that accepts a seed as input. Our core mechanism involves sampling a candidate set of tokens and constructing a map from possible message bit strings to these tokens. The output token is selected by applying this mapping to the real secret message, which provably preserves the original model's distribution. To ensure correct decoding, we address collision cases, where multiple candidate messages map to the same token, by maintaining and strategically expanding a dynamic collision set within a bounded size range. Extensive evaluations of three real-world datasets and three large language models demonstrate that our sampling-based method is comparable with existing PSS methods in efficiency and capacity.
