SWIFT: Semantic Watermarking for Image Forgery Thwarting

Gautier Evennou; Vivien Chappelier; Ewa Kijak; Teddy Furon

SWIFT: Semantic Watermarking for Image Forgery Thwarting

Gautier Evennou, Vivien Chappelier, Ewa Kijak, Teddy Furon

TL;DR

SWIFT addresses image integrity under benign and malicious edits by embedding a semantic caption as a high-dimensional unit-norm vector $X \in \mathbb{R}^D$ with $D=256$ into the image using a modified HiDDeN-based encoder/decoder (Hide-$\mathbb{R}$). It decouples watermarking from encoding and uses a variable-rate TCCSK modulation layer to transmit a compressed caption, coupled with a confidence metric $\rho$ linked to the Message Recovery Rate for reliable authenticity decisions. The main contributions include the Hide-$\mathbb{R}$ watermarking layer, caption compression via arithmetic coding guided by an LLM (LLMZip), and a probabilistic decoding framework that provides actionable trust signals under transforms. Practically, SWIFT delivers robust semantic integrity verification suitable for moderation pipelines and watermark-enabled provenance, with demonstrated improvements over state-of-the-art methods across a range of benign and malignant edits.

Abstract

This paper proposes a novel approach towards image authentication and tampering detection by using watermarking as a communication channel for semantic information. We modify the HiDDeN deep-learning watermarking architecture to embed and extract high-dimensional real vectors representing image captions. Our method improves significantly robustness on both malign and benign edits. We also introduce a local confidence metric correlated with Message Recovery Rate, enhancing the method's practical applicability. This approach bridges the gap between traditional watermarking and passive forensic methods, offering a robust solution for image integrity verification.

SWIFT: Semantic Watermarking for Image Forgery Thwarting

TL;DR

SWIFT addresses image integrity under benign and malicious edits by embedding a semantic caption as a high-dimensional unit-norm vector

with

into the image using a modified HiDDeN-based encoder/decoder (Hide-

). It decouples watermarking from encoding and uses a variable-rate TCCSK modulation layer to transmit a compressed caption, coupled with a confidence metric

linked to the Message Recovery Rate for reliable authenticity decisions. The main contributions include the Hide-

watermarking layer, caption compression via arithmetic coding guided by an LLM (LLMZip), and a probabilistic decoding framework that provides actionable trust signals under transforms. Practically, SWIFT delivers robust semantic integrity verification suitable for moderation pipelines and watermark-enabled provenance, with demonstrated improvements over state-of-the-art methods across a range of benign and malignant edits.

Abstract

Paper Structure (18 sections, 8 equations, 6 figures, 2 tables)

This paper contains 18 sections, 8 equations, 6 figures, 2 tables.

Introduction
Related Work
Image forensics
Image watermarking
Method
The message layer
The watermarking layer
The modulation layer
Multi-bit watermarking with confidence
Modulating variable-rate signals
Security
Experiments & Results
Evaluation
Comparison with state of the art
Confidence metric
...and 3 more sections

Figures (6)

Figure 1: Overview of SWIFT. To ensure the integrity of an image, Alice first leverages the message layer to tailor a semantic representation of the image, in our case a caption compressed in a lossless fashion. The resulting bit stream is fed to the TCCSK modulation layer thus enabling security and confidence (see \ref{['sec:decoding_confidence']}), and then to the watermarking layer based on our Hide-$\mathbb{R}$ encoder-decoder neural network. At reception, Bob executes the inverse process with a secret key and obtains both the caption and the p-value $\rho$. If $\rho$ is low enough, Bob can entrust the decoded caption and compare the image he received with the caption, enabling comparison between a proxy of the original image and the received one.
Figure 2: Length distribution of BLIP2 captions encoded by OPT-125m version. We show that the finetuned version leads to entropy reduction and thus is more efficient to encode captions.
Figure 3: Hide-$\mathbb{R}$ architecture. We use a L2 norm to control the watermark power to enforce a target PSNR on both the watermark signal in the embedder and on the decoded vector in the extractor.
Figure 4: Performance of different encoding schemes under additive white Gaussian noise. $S_{256}^{\sqrt{3}}$ is the spherical lattice from spreadingvectors, capable of encoding $22108160 \simeq 2^{24.4}$ messages. TCCSK is most suited for our use case as it can cope with variable length messages.
Figure 5: Representation of the message space after modulation. The circles $\mathcal{V}_{M},\mathcal{V}_{\hat{M}}$ illustrate Voronoï cells mapping to different binary messages on the surface of the hyper-sphere. $X$ is associated to the message to be hidden $M$. The extraction retrieves $Y$, which is decoded into $\hat{M}$, while $\hat{X}$ results from the modulation of $\hat{M}$, accounting for the perfect representation of $\hat{M}$. The similarity between $Y$ and $\hat{X}$ is given by $C$. This provides a confidence score for the decoding as explained in \ref{['sec:decoding_confidence']}. The given example illustrates a failure case with a wrong decoded message and a low confidence reflected by a high value of $\rho$.
...and 1 more figures

SWIFT: Semantic Watermarking for Image Forgery Thwarting

TL;DR

Abstract

SWIFT: Semantic Watermarking for Image Forgery Thwarting

Authors

TL;DR

Abstract

Table of Contents

Figures (6)