Event-aided Semantic Scene Completion

Shangwei Guo; Hao Shi; Song Wang; Xiaoting Yin; Kailun Yang; Kaiwei Wang

Event-aided Semantic Scene Completion

Shangwei Guo, Hao Shi, Song Wang, Xiaoting Yin, Kailun Yang, Kaiwei Wang

TL;DR

This work tackles robust 3D scene understanding for autonomous driving by augmenting Semantic Scene Completion with event-camera data. It introduces DSEC-SSC, the first real-world event-enabled SSC dataset with a deployable 4D labeling pipeline, and EvSSC, an RGB-Event fusion framework built around an Event-aided Lifting Module (ELM) that bridges 2D features to 3D occupancy. EvSSC demonstrates consistent gains across transformer- and LSS-based SSC models, achieving up to $52.5\%$ relative improvement in $mIoU$ on corrupted data and enhanced performance under motion blur and adverse weather, while maintaining modest latency and memory overhead. The publicly released dataset and codebase support broader adoption and further exploration of event-based semantic scene understanding for safer, more reliable autonomous perception.

Abstract

Autonomous driving systems rely on robust 3D scene understanding. Recent advances in Semantic Scene Completion (SSC) for autonomous driving underscore the limitations of RGB-based approaches, which struggle under motion blur, poor lighting, and adverse weather. Event cameras, offering high dynamic range and low latency, address these challenges by providing asynchronous data that complements RGB inputs. We present DSEC-SSC, the first real-world benchmark specifically designed for event-aided SSC, which includes a novel 4D labeling pipeline for generating dense, visibility-aware labels that adapt dynamically to object motion. Our proposed RGB-Event fusion framework, EvSSC, introduces an Event-aided Lifting Module (ELM) that effectively bridges 2D RGB-Event features to 3D space, enhancing view transformation and the robustness of 3D volume construction across SSC models. Extensive experiments on DSEC-SSC and simulated SemanticKITTI-E demonstrate that EvSSC is adaptable to both transformer-based and LSS-based SSC architectures. Notably, evaluations on SemanticKITTI-C demonstrate that EvSSC achieves consistently improved prediction accuracy across five degradation modes and both In-domain and Out-of-domain settings, achieving up to a 52.5% relative improvement in mIoU when the image sensor partially fails. Additionally, we quantitatively and qualitatively validate the superiority of EvSSC under motion blur and extreme weather conditions, where autonomous driving is challenged. The established datasets and our codebase will be made publicly at https://github.com/Pandapan01/EvSSC.

Event-aided Semantic Scene Completion

TL;DR

Abstract

Event-aided Semantic Scene Completion

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)