Lightning Fast Video Anomaly Detection via Adversarial Knowledge Distillation
Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Dana Dascalescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah
TL;DR
This work tackles real-time video anomaly detection by training a lightweight frame-level convolutional-transformer student to imitate two strong object-centric teachers through standard distillation ($\mathcal{L}_{\mathrm{KD}}$) and novel adversarial distillation ($\mathcal{L}_{\mathrm{AKD}}$). The approach uses a two-phase training regime, replacing heavy detectors with a downsampling, multi-head transformer that outputs multi-scale anomaly maps, and combining teacher guidance from two sources to improve generalization. Empirical results on Avenue, ShanghaiTech, and UCSD Ped2 show the method achieves an exceptional speed-accuracy balance, up to $1480$ FPS and competitive micro/macro AUC scores, while reducing GFLOPs and memory usage. The work demonstrates the practicality of deploying surveillance-scale anomaly detection with minimal hardware demands, and discusses avenues for strengthening accuracy via additional teachers and tasks.
Abstract
We propose a very fast frame-level model for anomaly detection in video, which learns to detect anomalies by distilling knowledge from multiple highly accurate object-level teacher models. To improve the fidelity of our student, we distill the low-resolution anomaly maps of the teachers by jointly applying standard and adversarial distillation, introducing an adversarial discriminator for each teacher to distinguish between target and generated anomaly maps. We conduct experiments on three benchmarks (Avenue, ShanghaiTech, UCSD Ped2), showing that our method is over 7 times faster than the fastest competing method, and between 28 and 62 times faster than object-centric models, while obtaining comparable results to recent methods. Our evaluation also indicates that our model achieves the best trade-off between speed and accuracy, due to its previously unheard-of speed of 1480 FPS. In addition, we carry out a comprehensive ablation study to justify our architectural design choices. Our code is freely available at: https://github.com/ristea/fast-aed.
