Evaluating Durability: Benchmark Insights into Multimodal Watermarking
Jielin Qiu, William Han, Xuandong Zhao, Shangbang Long, Christos Faloutsos, Lei Li
TL;DR
This work addresses the robustness of multimodal watermarks embedded in content generated by image and text models. It introduces a cross-modal benchmark thatinjects 4 image watermarks and 4 text watermarks into 5,000 image-caption and 5,000 caption-generated images across 100 image perturbations and 63 text perturbations, evaluated over 16 diverse benchmark models. The study finds that watermark robustness is highly sensitive to distribution shifts, with perturbation type driving performance (e.g., Zoom Blur and OCR errors being particularly disruptive) and model choice shaping outcomes (e.g., SDXL-Lightning excels for image robustness, while LLaVA fares better for text). By systematically contrasting watermarking strategies and perturbation categories, the work provides actionable guidance for developing more robust multimodal watermarking techniques and offers a public codebase and benchmark for future research.
Abstract
With the development of large models, watermarks are increasingly employed to assert copyright, verify authenticity, or monitor content distribution. As applications become more multimodal, the utility of watermarking techniques becomes even more critical. The effectiveness and reliability of these watermarks largely depend on their robustness to various disturbances. However, the robustness of these watermarks in real-world scenarios, particularly under perturbations and corruption, is not well understood. To highlight the significance of robustness in watermarking techniques, our study evaluated the robustness of watermarked content generated by image and text generation models against common real-world image corruptions and text perturbations. Our results could pave the way for the development of more robust watermarking techniques in the future. Our project website can be found at \url{https://mmwatermark-robustness.github.io/}.
