ShapeAug: Occlusion Augmentation for Event Camera Data
Katharina Bendig, René Schuster, Didier Stricker
TL;DR
This work tackles occlusion in event-camera data by introducing ShapeAug, an occlusion-aware augmentation that simulates moving foreground shapes to generate both occlusion and the associated events across temporal slices $T$. By sampling $N\in[1,5]$ shapes with random start positions, sizes up to $s_{max}$, speed $v$, and movement angle, ShapeAug preserves temporal coherence while injecting realistic occlusion dynamics. Evaluations on multiple DVS datasets and the Gen1 automotive dataset show consistent improvements in top-1 accuracy (up to $6.5\%$) and pedestrian AP (over $5\%$), demonstrating robustness to occlusion and compatibility with other augmentations. The results highlight ShapeAug’s practical value for robust event-based classification and detection in dynamic driving scenarios, while pointing to future work in more complex shapes and motion patterns to further bridge the gap to real-world scenes.
Abstract
Recently, Dynamic Vision Sensors (DVSs) sparked a lot of interest due to their inherent advantages over conventional RGB cameras. These advantages include a low latency, a high dynamic range and a low energy consumption. Nevertheless, the processing of DVS data using Deep Learning (DL) methods remains a challenge, particularly since the availability of event training data is still limited. This leads to a need for event data augmentation techniques in order to improve accuracy as well as to avoid over-fitting on the training data. Another challenge especially in real world automotive applications is occlusion, meaning one object is hindering the view onto the object behind it. In this paper, we present a novel event data augmentation approach, which addresses this problem by introducing synthetic events for randomly moving objects in a scene. We test our method on multiple DVS classification datasets, resulting in an relative improvement of up to 6.5 % in top1-accuracy. Moreover, we apply our augmentation technique on the real world Gen1 Automotive Event Dataset for object detection, where we especially improve the detection of pedestrians by up to 5 %.
