E-RGB-D: Real-Time Event-Based Perception with Structured Light
Seyed Ehsan Marjani Bajestani, Giovanni Beltrame
TL;DR
Monochrome event cameras struggle to capture color and rapid RGB-D information in dynamic scenes. The authors integrate a DLP-based Adaptive Structured Light system with an event camera to achieve per-event color and depth without frame aggregation, enabling frameless, high-speed RGB-D sensing. Key contributions include per-event disparity lookups, flexible pattern design with CP/FR trade-offs, and real-time color-depth reconstruction achieving up to 1.4 kHz color and 4 kHz depth, demonstrated on a 640×480 Prophesee sensor with a TI LightCrafter 4500 projector. The work advances rapid, low-latency 3D perception for robotics and real-time 3D reconstruction, with open-source ROS-based tooling available at the project repository.
Abstract
Event-based cameras (ECs) have emerged as bio-inspired sensors that report pixel brightness changes asynchronously, offering unmatched speed and efficiency in vision sensing. Despite their high dynamic range, temporal resolution, low power consumption, and computational simplicity, traditional monochrome ECs face limitations in detecting static or slowly moving objects and lack color information essential for certain applications. To address these challenges, we present a novel approach that integrates a Digital Light Processing (DLP) projector, forming Active Structured Light (ASL) for RGB-D sensing. By combining the benefits of ECs and projection-based techniques, our method enables the detection of color and the depth of each pixel separately. Dynamic projection adjustments optimize bandwidth, ensuring selective color data acquisition and yielding colorful point clouds without sacrificing spatial resolution. This integration, facilitated by a commercial TI LightCrafter 4500 projector and a monocular monochrome EC, not only enables frameless RGB-D sensing applications but also achieves remarkable performance milestones. With our approach, we achieved a color detection speed equivalent to 1400 fps and 4 kHz of pixel depth detection, significantly advancing the realm of computer vision across diverse fields from robotics to 3D reconstruction methods. Our code is publicly available: https://github.com/MISTLab/event_based_rgbd_ros
