SWIFT: Semantic Watermarking for Image Forgery Thwarting
Gautier Evennou, Vivien Chappelier, Ewa Kijak, Teddy Furon
TL;DR
SWIFT addresses image integrity under benign and malicious edits by embedding a semantic caption as a high-dimensional unit-norm vector $X \in \mathbb{R}^D$ with $D=256$ into the image using a modified HiDDeN-based encoder/decoder (Hide-$\mathbb{R}$). It decouples watermarking from encoding and uses a variable-rate TCCSK modulation layer to transmit a compressed caption, coupled with a confidence metric $\rho$ linked to the Message Recovery Rate for reliable authenticity decisions. The main contributions include the Hide-$\mathbb{R}$ watermarking layer, caption compression via arithmetic coding guided by an LLM (LLMZip), and a probabilistic decoding framework that provides actionable trust signals under transforms. Practically, SWIFT delivers robust semantic integrity verification suitable for moderation pipelines and watermark-enabled provenance, with demonstrated improvements over state-of-the-art methods across a range of benign and malignant edits.
Abstract
This paper proposes a novel approach towards image authentication and tampering detection by using watermarking as a communication channel for semantic information. We modify the HiDDeN deep-learning watermarking architecture to embed and extract high-dimensional real vectors representing image captions. Our method improves significantly robustness on both malign and benign edits. We also introduce a local confidence metric correlated with Message Recovery Rate, enhancing the method's practical applicability. This approach bridges the gap between traditional watermarking and passive forensic methods, offering a robust solution for image integrity verification.
