Skeletal Video Anomaly Detection using Deep Learning: Survey, Challenges and Future Directions
Pratik K. Mishra, Alex Mihailidis, Shehroz S. Khan
TL;DR
This paper addresses privacy concerns in video anomaly detection by advocating skeleton-based representations that obviate facial identity and appearance details. It surveys deep learning approaches organized into four learning paradigms—reconstruction, prediction, their combinations, and other methods—across 29 papers, highlighting graph-based and sequence-based models that operate on spatio-temporal skeleton graphs. The review identifies datasets, evaluation metrics, and practical challenges such as pose estimation accuracy, multi-person tracking, and thresholding, and discusses future directions like privacy-preserving multimodal fusion and federated learning. The work aims to catalyze the adoption of privacy-protecting anomaly detection in homes, care facilities, and other sensitive environments, ultimately improving safety and quality of life while maintaining individual privacy.
Abstract
The existing methods for video anomaly detection mostly utilize videos containing identifiable facial and appearance-based features. The use of videos with identifiable faces raises privacy concerns, especially when used in a hospital or community-based setting. Appearance-based features can also be sensitive to pixel-based noise, straining the anomaly detection methods to model the changes in the background and making it difficult to focus on the actions of humans in the foreground. Structural information in the form of skeletons describing the human motion in the videos is privacy-protecting and can overcome some of the problems posed by appearance-based features. In this paper, we present a survey of privacy-protecting deep learning anomaly detection methods using skeletons extracted from videos. We present a novel taxonomy of algorithms based on the various learning approaches. We conclude that skeleton-based approaches for anomaly detection can be a plausible privacy-protecting alternative for video anomaly detection. Lastly, we identify major open research questions and provide guidelines to address them.
