CRISTAL: Real-time Camera Registration in Static LiDAR Scans using Neural Rendering
Joni Vanherck, Steven Moonen, Brent Zoomers, Kobe Werner, Jeroen Put, Lode Jorissen, Nick Michiels
TL;DR
CRISTAL addresses drift and scale ambiguity in camera localization by localizing within a pre-captured colored LiDAR map. It introduces a neural point cloud renderer to synthesize photorealistic views from the LiDAR scan and two real-time pipelines: Online Render & Match for immediate relocalization and Prebuild & Localize for offline drift-free mapping compatible with standard SLAM. Evaluations on ScanNet++ and custom datasets demonstrate improved pose accuracy and photometric alignment over traditional SLAM, with the P&L approach enabling drift-free tracking in a global LiDAR frame. The work enables robust, real-time AR and robotics localization in large-scale environments using a single static LiDAR scan, and lays groundwork for dynamic map updates and lighting-variation experiments in future work.
Abstract
Accurate camera localization is crucial for robotics and Extended Reality (XR), enabling reliable navigation and alignment of virtual and real content. Existing visual methods often suffer from drift, scale ambiguity, and depend on fiducials or loop closure. This work introduces a real-time method for localizing a camera within a pre-captured, highly accurate colored LiDAR point cloud. By rendering synthetic views from this cloud, 2D-3D correspondences are established between live frames and the point cloud. A neural rendering technique narrows the domain gap between synthetic and real images, reducing occlusion and background artifacts to improve feature matching. The result is drift-free camera tracking with correct metric scale in the global LiDAR coordinate system. Two real-time variants are presented: Online Render and Match, and Prebuild and Localize. We demonstrate improved results on the ScanNet++ dataset and outperform existing SLAM pipelines.
