Pedestrian intention prediction in Adverse Weather Conditions with Spiking Neural Networks and Dynamic Vision Sensors
Mustafa Sakhai, Szymon Mazurek, Jakub Caputa, Jan K. Argasiński, Maciej Wielgosz
TL;DR
The paper tackles robust pedestrian detection and intention prediction under adverse weather by integrating Spiking Neural Networks (SNNs) with Dynamic Vision Sensors (DVS) and evaluating them against CNN baselines using a CARLA-generated dataset. It introduces PLIF-based SNNs with surrogate gradient training, augmented by ResNet adaptations and Temporally Effective Batch Normalization (TEBN), and analyzes two tasks: clip-level detection and horizon-based prediction, across weather-variant scenes. Key findings show that SNNs with DVS achieve superior energy efficiency and competitive or superior accuracy in difficult conditions, especially as temporal windows widen, while RGB-based CNNs remain stronger in clear weather. The work suggests a hybrid deployment strategy that leverages DVS+SNN for adverse conditions and RGB+CNN for normal conditions, potentially enhancing safety and efficiency in autonomous-vehicle perception systems. The study provides publicly available code and datasets to foster reproducibility and future exploration of DVS-enabled neuromorphic perception in intelligent transportation systems.
Abstract
This study examines the effectiveness of Spiking Neural Networks (SNNs) paired with Dynamic Vision Sensors (DVS) to improve pedestrian detection in adverse weather, a significant challenge for autonomous vehicles. Utilizing the high temporal resolution and low latency of DVS, which excels in dynamic, low-light, and high-contrast environments, we assess the efficiency of SNNs compared to traditional Convolutional Neural Networks (CNNs). Our experiments involved testing across diverse weather scenarios using a custom dataset from the CARLA simulator, mirroring real-world variability. SNN models, enhanced with Temporally Effective Batch Normalization, were trained and benchmarked against state-of-the-art CNNs to demonstrate superior accuracy and computational efficiency in complex conditions such as rain and fog. The results indicate that SNNs, integrated with DVS, significantly reduce computational overhead and improve detection accuracy in challenging conditions compared to CNNs. This highlights the potential of DVS combined with bio-inspired SNN processing to enhance autonomous vehicle perception and decision-making systems, advancing intelligent transportation systems' safety features in varying operational environments. Additionally, our research indicates that SNNs perform more efficiently in handling long perception windows and prediction tasks, rather than simple pedestrian detection.
