Dual JPEG Compatibility: a Reliable and Explainable Tool for Image Forensics
Etienne Levecque, Jan Butora, Patrick Bas
TL;DR
This work tackles JPEG forgery detection by exploiting block-level compatibility under JPEG pipelines, introducing a dual extension of antecedent search that can handle arbitrary composed pipelines. It formalizes the forward and backward JPEG processes, derives theoretical bounds for constrained local search, and shows that modified blocks produce three mismatch types (grid, quantization, pipeline) that can be detected to localize forgeries at the JPEG-block level. In experiments, the approach outperforms three state-of-the-art deep learning baselines in idealized settings and achieves high block-level localization accuracy, especially for unmodified images or when second compressions strengthen the original quality factor. As a proof-of-concept, the method emphasizes explainability and deterministic detection, while noting limitations related to exact pipeline knowledge and computational cost, and outlining directions for robustness to unknown pipelines and faster search.
Abstract
Given a JPEG pipeline (compression or decompression), this paper demonstrates how to find the antecedent of an 8x8 block. If it exists, the block is considered compatible with the pipeline. For unaltered images, all blocks remain compatible with the original pipeline; however, for manipulated images, this is not necessarily true. This article provides a first demonstration of the potential of compatibility-based approaches for JPEG image forensics. It introduces a method to address the key challenge of finding a block antecedent in a high-dimensional space, relying on a local search algorithm with restrictions on the search space. We show that inpainting, copy-move, and splicing, when applied after JPEG compression, result in three distinct mismatch problems that can be detected. In particular, if the image is re-compressed after modification, the manipulation can be detected when the quality factor of the second compression is higher than that of the first. Through extensive experiments, we highlight the potential of this compatibility attack under varying degrees of assumptions. While our approach shows promising results-outperforming three state-of-the-art deep learning models in an idealized setting-it remains a proof of concept rather than an off-the-shelf forensic tool. Notably, with a perfect knowledge of the JPEG pipeline, our method guarantees zero false alarms in block-by-block localization, given sufficient computational power.
