Harmfully Manipulated Images Matter in Multimodal Misinformation Detection

Bing Wang; Shengsheng Wang; Changchun Li; Renchu Guan; Ximing Li

Harmfully Manipulated Images Matter in Multimodal Misinformation Detection

Bing Wang, Shengsheng Wang, Changchun Li, Renchu Guan, Ximing Li

TL;DR

This work addresses multimodal misinformation detection by leveraging manipulation traces in images and the underlying manipulation intentions (harmful vs harmless). It introduces Hami-m$^3$d, a three-task model that learns manipulation features $\mathbf{e}^M$ and intention features $\mathbf{e}^E$ through a four-encoder architecture and a multi-head attention fusion, supervised by a veracity predictor plus auxiliary manipulation and intention classifiers. To overcome the lack of ground-truth labels for manipulation and intention, the method uses two weakly supervised signals: a manipulation teacher trained on external image manipulation data with PU adaptation and a PU-based objective for intention, along with a reliability-based pruning mechanism. Extensive experiments on GossipCop, Weibo, and Twitter show consistent improvements over strong baselines, with ablations confirming the value of both the manipulation/intention features and the PU-based supervision. The approach offers a scalable, weakly supervised pathway to incorporate manipulation cues into practical MMD systems, potentially improving resilience to multimodal misinformation.

Abstract

Nowadays, misinformation is widely spreading over various social media platforms and causes extremely negative impacts on society. To combat this issue, automatically identifying misinformation, especially those containing multimodal content, has attracted growing attention from the academic and industrial communities, and induced an active research topic named Multimodal Misinformation Detection (MMD). Typically, existing MMD methods capture the semantic correlation and inconsistency between multiple modalities, but neglect some potential clues in multimodal content. Recent studies suggest that manipulated traces of the images in articles are non-trivial clues for detecting misinformation. Meanwhile, we find that the underlying intentions behind the manipulation, e.g., harmful and harmless, also matter in MMD. Accordingly, in this work, we propose to detect misinformation by learning manipulation features that indicate whether the image has been manipulated, as well as intention features regarding the harmful and harmless intentions of the manipulation. Unfortunately, the manipulation and intention labels that make these features discriminative are unknown. To overcome the problem, we propose two weakly supervised signals as alternatives by introducing additional datasets on image manipulation detection and formulating two classification tasks as positive and unlabeled learning problems. Based on these ideas, we propose a novel MMD method, namely Harmfully Manipulated Images Matter in MMD (HAMI-M3D). Extensive experiments across three benchmark datasets can demonstrate that HAMI-M3D can consistently improve the performance of any MMD baselines.

Harmfully Manipulated Images Matter in Multimodal Misinformation Detection

TL;DR

This work addresses multimodal misinformation detection by leveraging manipulation traces in images and the underlying manipulation intentions (harmful vs harmless). It introduces Hami-m

d, a three-task model that learns manipulation features

and intention features

through a four-encoder architecture and a multi-head attention fusion, supervised by a veracity predictor plus auxiliary manipulation and intention classifiers. To overcome the lack of ground-truth labels for manipulation and intention, the method uses two weakly supervised signals: a manipulation teacher trained on external image manipulation data with PU adaptation and a PU-based objective for intention, along with a reliability-based pruning mechanism. Extensive experiments on GossipCop, Weibo, and Twitter show consistent improvements over strong baselines, with ablations confirming the value of both the manipulation/intention features and the PU-based supervision. The approach offers a scalable, weakly supervised pathway to incorporate manipulation cues into practical MMD systems, potentially improving resilience to multimodal misinformation.

Abstract

Paper Structure (16 sections, 11 equations, 5 figures, 4 tables, 1 algorithm)

This paper contains 16 sections, 11 equations, 5 figures, 4 tables, 1 algorithm.

Introduction
Related Works
Multimodal Misinformation Detection
Positive-Unlabeled Learning
Proposed Hami-m$^3$d Method
Overview of Hami-m$^3$d
Manipulation Classification
Intention Classification
Experiments
Experimental Settings
Main Results
Ablative Study
Sensitivity Analysis
Visualization Analysis
Case Study
...and 1 more sections

Figures (5)

Figure 1: The statistics on an MMD dataset Twitter illustrate the quantitative relationship between image manipulation and veracity labels. We use a pre-trained image manipulation detector to discriminate whether the image has been manipulated. We also provide several examples of images manipulated with harmful and harmless intentions.
Figure 2: The overall framework of Hami-m$^3$d. Given text content $\mathbf{x}_i^T$ and an image $\mathbf{x}_i^I$, we use four encoders including text encoder, image encoder, manipulation encoder, and intention encoder to extract their corresponding features. These features are then input into a feature fusion network to obtain a fused feature. Finally, we propose three predictors to achieve three different tasks: veracity classification, manipulation classification, and intention classification.
Figure 3: Sensitivity analysis of the parameters $\alpha$ and $\beta$.
Figure 4: Visualization analysis of features $\mathbf{z}$, $\mathbf{e}^M$ and $\mathbf{e}^E$ with the T-SNE method.
Figure 5: We illustrate three representative examples for the case study.

Harmfully Manipulated Images Matter in Multimodal Misinformation Detection

TL;DR

Abstract

Harmfully Manipulated Images Matter in Multimodal Misinformation Detection

Authors

TL;DR

Abstract

Table of Contents

Figures (5)