uTRAND: Unsupervised Anomaly Detection in Traffic Trajectories
Giacomo D'Amicantonio, Egor Bondarau, Peter H. N. de With
TL;DR
The paper tackles unsupervised anomaly detection in traffic trajectories by shifting from pixel-based analysis to a semantic-topological representation of intersections. It constructs a patch-based graph over semantically segmented BEV images (using ITF maps) and learns normal inter-node transitions to detect anomalous trajectories, enabling human-readable classification rules. The methodology combines automated camera calibration (GNN-based homographies), agent detection/tracking (YOLOv8 with BEV fusion), and a graph-based anomaly detector with attributes $S_{node_i}$, $T_{avg}$, and $A$, all operating without labeled anomaly data. On real-world multi-camera data, uTRAND outperforms state-of-the-art unsupervised methods in $F_1$ while providing interpretable explanations for detected anomalies.
Abstract
Deep learning-based approaches have achieved significant improvements on public video anomaly datasets, but often do not perform well in real-world applications. This paper addresses two issues: the lack of labeled data and the difficulty of explaining the predictions of a neural network. To this end, we present a framework called uTRAND, that shifts the problem of anomalous trajectory prediction from the pixel space to a semantic-topological domain. The framework detects and tracks all types of traffic agents in bird's-eye-view videos of traffic cameras mounted at an intersection. By conceptualizing the intersection as a patch-based graph, it is shown that the framework learns and models the normal behaviour of traffic agents without costly manual labeling. Furthermore, uTRAND allows to formulate simple rules to classify anomalous trajectories in a way suited for human interpretation. We show that uTRAND outperforms other state-of-the-art approaches on a dataset of anomalous trajectories collected in a real-world setting, while producing explainable detection results.
