IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera

Jian Huang; Chengrui Dong; Xuanhua Chen; Peidong Liu

IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera

Jian Huang, Chengrui Dong, Xuanhua Chen, Peidong Liu

TL;DR

IncEventGS introduces a pose-free, incremental $3$D Gaussian Splatting system trained solely on a single event camera. By processing the event stream in chunks and optimizing both the $3$D Gaussian representation and a continuous $SE(3)$ camera trajectory via an event-based loss and SSIM term, it achieves high-quality novel view synthesis and accurate motion estimation without ground-truth poses. The method bootstraps with random Gaussians and depth-based reinitialization, grows the map incrementally with a visibility-guided expansion, and performs dense bundle adjustment in a sliding window, yielding strong results against state-of-the-art event-based NeRFs and VO baselines on Replica and TUM-VIE datasets. It also demonstrates extensions to color events and fast-motion scenarios, with substantial speed advantages over frame-based NeRF approaches, highlighting practical applicability for real-time event-based 3D reconstruction.

Abstract

Implicit neural representation and explicit 3D Gaussian Splatting (3D-GS) for novel view synthesis have achieved remarkable progress with frame-based camera (e.g. RGB and RGB-D cameras) recently. Compared to frame-based camera, a novel type of bio-inspired visual sensor, i.e. event camera, has demonstrated advantages in high temporal resolution, high dynamic range, low power consumption and low latency. Due to its unique asynchronous and irregular data capturing process, limited work has been proposed to apply neural representation or 3D Gaussian splatting for an event camera. In this work, we present IncEventGS, an incremental 3D Gaussian Splatting reconstruction algorithm with a single event camera. To recover the 3D scene representation incrementally, we exploit the tracking and mapping paradigm of conventional SLAM pipelines for IncEventGS. Given the incoming event stream, the tracker firstly estimates an initial camera motion based on prior reconstructed 3D-GS scene representation. The mapper then jointly refines both the 3D scene representation and camera motion based on the previously estimated motion trajectory from the tracker. The experimental results demonstrate that IncEventGS delivers superior performance compared to prior NeRF-based methods and other related baselines, even we do not have the ground-truth camera poses. Furthermore, our method can also deliver better performance compared to state-of-the-art event visual odometry methods in terms of camera motion estimation. Code is publicly available at: https://github.com/wu-cvgl/IncEventGS.

IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera

TL;DR

Abstract

IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)