Anomalies by Synthesis: Anomaly Detection using Generative Diffusion Models for Off-Road Navigation

Siddharth Ancha; Sunshine Jiang; Travis Manderson; Laura Brandt; Yilun Du; Philip R. Osteen; Nicholas Roy

Anomalies by Synthesis: Anomaly Detection using Generative Diffusion Models for Off-Road Navigation

Siddharth Ancha, Sunshine Jiang, Travis Manderson, Laura Brandt, Yilun Du, Philip R. Osteen, Nicholas Roy

TL;DR

This work tackles robust anomaly detection for off-road navigation by reframing it as a post-hoc analysis-by-synthesis problem. It uses a diffusion model trained on in-distribution data to edit the input image, removing anomalies while preserving non-OOd content, and then detects anomalous regions by comparing the input and edited images in a semantically rich feature space. A principled guided-diffusion mechanism based on an ideal and a tractable approximation of the guidance gradient enables edit-focused sampling without retraining. The pipeline combines MaskCLIP/FeatUp and SAM to produce accurate, pixel-wise anomaly maps, and demonstrates strong gains on RUGD and REllis-3D datasets, with qualitative underwater results, illustrating practical impact for autonomous off-road systems.

Abstract

In order to navigate safely and reliably in off-road and unstructured environments, robots must detect anomalies that are out-of-distribution (OOD) with respect to the training data. We present an analysis-by-synthesis approach for pixel-wise anomaly detection without making any assumptions about the nature of OOD data. Given an input image, we use a generative diffusion model to synthesize an edited image that removes anomalies while keeping the remaining image unchanged. Then, we formulate anomaly detection as analyzing which image segments were modified by the diffusion model. We propose a novel inference approach for guided diffusion by analyzing the ideal guidance gradient and deriving a principled approximation that bootstraps the diffusion model to predict guidance gradients. Our editing technique is purely test-time that can be integrated into existing workflows without the need for retraining or fine-tuning. Finally, we use a combination of vision-language foundation models to compare pixels in a learned feature space and detect semantically meaningful edits, enabling accurate anomaly detection for off-road navigation. Project website: https://siddancha.github.io/anomalies-by-diffusion-synthesis/

Anomalies by Synthesis: Anomaly Detection using Generative Diffusion Models for Off-Road Navigation

TL;DR

Abstract

Anomalies by Synthesis: Anomaly Detection using Generative Diffusion Models for Off-Road Navigation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)