Predict and Resist: Long-Term Accident Anticipation under Sensor Noise

Xingcheng Liu; Bin Rao; Yanchen Guan; Chengyue Wang; Haicheng Liao; Jiaxun Zhang; Chengyu Lin; Meixin Zhu; Zhenning Li

Predict and Resist: Long-Term Accident Anticipation under Sensor Noise

Xingcheng Liu, Bin Rao, Yanchen Guan, Chengyue Wang, Haicheng Liao, Jiaxun Zhang, Chengyu Lin, Meixin Zhu, Zhenning Li

TL;DR

Addresses the challenge of accident anticipation under noisy sensor inputs while requiring timely warnings. Proposes a unified framework that fuses diffusion-based denoising with a time-aware actor-critic for long-horizon risk forecasting, enhanced by state-history processing and dual-level feature refinement. Key contributions include framing accident anticipation as a long-horizon credit assignment problem, image- and object-level diffusion modules for noise-robust features, a time-weighted anticipation loss and actor-critic objective, and state-of-the-art AP and mean Time-to-Accident ($\text{mTTA}$) on DAD, CCD, and A3D under Gaussian and impulse noise. Qualitative analyses show earlier, more stable, human-aligned predictions in routine and complex traffic, supporting strong potential for real-world safety deployment. Overall, the work advances proactive autonomous driving by enabling robust, early warnings in degraded sensing environments.

Abstract

Accident anticipation is essential for proactive and safe autonomous driving, where even a brief advance warning can enable critical evasive actions. However, two key challenges hinder real-world deployment: (1) noisy or degraded sensory inputs from weather, motion blur, or hardware limitations, and (2) the need to issue timely yet reliable predictions that balance early alerts with false-alarm suppression. We propose a unified framework that integrates diffusion-based denoising with a time-aware actor-critic model to address these challenges. The diffusion module reconstructs noise-resilient image and object features through iterative refinement, preserving critical motion and interaction cues under sensor degradation. In parallel, the actor-critic architecture leverages long-horizon temporal reasoning and time-weighted rewards to determine the optimal moment to raise an alert, aligning early detection with reliability. Experiments on three benchmark datasets (DAD, CCD, A3D) demonstrate state-of-the-art accuracy and significant gains in mean time-to-accident, while maintaining robust performance under Gaussian and impulse noise. Qualitative analyses further show that our model produces earlier, more stable, and human-aligned predictions in both routine and highly complex traffic scenarios, highlighting its potential for real-world, safety-critical deployment.

Predict and Resist: Long-Term Accident Anticipation under Sensor Noise

TL;DR

Abstract

Predict and Resist: Long-Term Accident Anticipation under Sensor Noise

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)