LATTE: Latent Trajectory Embedding for Diffusion-Generated Image Detection

Ana Vasilcoiu; Ivona Najdenkoska; Zeno Geradts; Marcel Worring

LATTE: Latent Trajectory Embedding for Diffusion-Generated Image Detection

Ana Vasilcoiu, Ivona Najdenkoska, Zeno Geradts, Marcel Worring

TL;DR

LATTE introduces latent trajectory embedding to detect diffusion-generated images by modeling the evolution of latent representations across multiple denoising steps. It extracts one-step-denoised latent states at selected timesteps, fuses them with visual features via transformer decoders, and aggregates them into a discriminative representation that is classified with a lightweight oracle. Across GenImage, Chameleon, and Diffusion Forensics, LATTE achieves state-of-the-art performance, with strong cross-generator and cross-domain robustness, including large gains on challenging subsets. The approach highlights latent trajectory modeling as a powerful direction for forensic detection of synthetic media with practical implications for digital trust and media verification.

Abstract

The rapid advancement of diffusion-based image generators has made it increasingly difficult to distinguish generated from real images. This erodes trust in digital media, making it critical to develop generated image detectors that remain reliable across different generators. While recent approaches leverage diffusion denoising cues, they typically rely on single-step reconstruction errors and overlook the sequential nature of the denoising process. In this work, we propose LATTE - LATent Trajectory Embedding - a novel approach that models the evolution of latent embeddings across multiple denoising steps. Instead of treating each denoising step in isolation, LATTE captures the trajectory of these representations, revealing subtle and discriminative patterns that distinguish real from generated images. Experiments on several benchmarks, such as GenImage, Chameleon, and Diffusion Forensics, show that LATTE achieves superior performance, especially in challenging cross-generator and cross-dataset scenarios, highlighting the potential of latent trajectory modeling. The code is available on the following link: https://github.com/AnaMVasilcoiu/LATTE-Diffusion-Detector.

LATTE: Latent Trajectory Embedding for Diffusion-Generated Image Detection

TL;DR

Abstract

LATTE: Latent Trajectory Embedding for Diffusion-Generated Image Detection

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)