Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method

Peisong He; Leyao Zhu; Jiaxing Li; Shiqi Wang; Haoliang Li

Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method

Peisong He, Leyao Zhu, Jiaxing Li, Shiqi Wang, Haoliang Li

TL;DR

This work addresses AI-generated video security by (1) building a diverse diffusion-based video dataset with realistic degradations and (2) proposing a detector that exploits both local motion prediction errors and global appearance variation, fused via channel attention. The approach demonstrates strong cross-generator generalization and robustness to video lossy operations, surpassing baselines and providing a concrete benchmark for future video forensics in the AI-generated content era. The dataset and method offer a practical baseline for evaluating and advancing AI-generated video forensics under varied generation and transmission conditions.

Abstract

The generative model has made significant advancements in the creation of realistic videos, which causes security issues. However, this emerging risk has not been adequately addressed due to the absence of a benchmark dataset for AI-generated videos. In this paper, we first construct a video dataset using advanced diffusion-based video generation algorithms with various semantic contents. Besides, typical video lossy operations over network transmission are adopted to generate degraded samples. Then, by analyzing local and global temporal defects of current AI-generated videos, a novel detection framework by adaptively learning local motion information and global appearance variation is constructed to expose fake videos. Finally, experiments are conducted to evaluate the generalization and robustness of different spatial and temporal domain detection methods, where the results can serve as the baseline and demonstrate the research challenge for future studies.

Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method

TL;DR

Abstract

Paper Structure (23 sections, 3 equations, 2 figures, 3 tables)

This paper contains 23 sections, 3 equations, 2 figures, 3 tables.

Introduction
Related Works
Video Generation Algorithm
AIGC Multimedia Forensics Dataset
AI-generated Video Dataset
Data Collection
Feature Analysis
Various Contents
Various Video Generators
Video Lossy Operations
Detection method
Representation Learning of Local Motion Information
Extraction of Frame Prediction Error
Temporal Aggregation
Representation Learning of Global Appearance Variation
...and 8 more sections

Figures (2)

Figure 1: Examples of AI-generated videos and feature analysis of our dataset.
Figure 2: The proposed AI-generated video detection framework based on local and global temporal defects.

Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method

TL;DR

Abstract

Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method

Authors

TL;DR

Abstract

Table of Contents

Figures (2)