Implicit Steganography Beyond the Constraints of Modality
Sojeong Song, Seoyun Yang, Chang D. Yoo, Junmo Kim
TL;DR
The paper addresses cross-modal steganography by proposing INRSteg, which encodes secret data as Implicit Neural Representations (INRs) and distributes them into a stego INR using layer-wise permutation guided by a private key, enabling multi-secret embedding across image, audio, video, and 3D shapes without training new models. It demonstrates state-of-the-art performance in both cross-modal and intra-modal tasks, achieves high capacity with a compact model (~0.4 million parameters), and shows robustness under quantization while remaining hard to detect by steganalysis tools. The approach unifies multimodal data into a single representation, avoids domain adaptation issues, and offers practical efficiency benefits, paving the way for secure and scalable cross-modal steganography. The key innovations include secret-INR allocation, diagonal-block weight updates, and the cryptographic layer-wise permutation with a 128-bit private key, which together provide strong security and flexibility across diverse data types.
Abstract
Cross-modal steganography is committed to hiding secret information of one modality in another modality. Despite the advancement in the field of steganography by the introduction of deep learning, cross-modal steganography still remains to be a challenge to the field. The incompatibility between different modalities not only complicate the hiding process but also results in increased vulnerability to detection. To rectify these limitations, we present INRSteg, an innovative cross-modal steganography framework based on Implicit Neural Representations (INRs). We introduce a novel network allocating framework with a masked parameter update which facilitates hiding multiple data and enables cross modality across image, audio, video and 3D shape. Moreover, we eliminate the necessity of training a deep neural network and therefore substantially reduce the memory and computational cost and avoid domain adaptation issues. To the best of our knowledge, in the field of steganography, this is the first to introduce diverse modalities to both the secret and cover data. Detailed experiments in extreme modality settings demonstrate the flexibility, security, and robustness of INRSteg.
