Been There, Scanned That: Nostalgia-Driven LiDAR Compression for Self-Driving Cars

Ali Khalid; Jaiaid Mobin; Sumanth Rao Appala; Avinash Maurya; Stephany Berrio Perez; M. Mustafa Rafique; Fawad Ahmad

Been There, Scanned That: Nostalgia-Driven LiDAR Compression for Self-Driving Cars

Ali Khalid, Jaiaid Mobin, Sumanth Rao Appala, Avinash Maurya, Stephany Berrio Perez, M. Mustafa Rafique, Fawad Ahmad

TL;DR

DejaView tackles the challenge of storing and transferring terabytes of autonomous-vehicle LiDAR data by exploiting cross-day redundancy through a cascaded diff against a past reference cloud and a 3D map. The method combines a two-stage diff with a one-way diff, a hybrid coarse/fine nearest-neighbor search, and delta-encoded indices, achieving up to $210\times$ compression with a reconstruction error of $15\text{ cm}$ on two months of data. Key contributions include the cascaded diff framework, efficient diffusion via a hybrid search, and an open-source end-to-end implementation, demonstrated to preserve localization, object detection, and 3D semantic segmentation performance at high compression. This work enables practical offline storage and forensic replay for AV data, reducing network and cloud costs while maintaining actionable fidelity in downstream perception tasks.

Abstract

An autonomous vehicle can generate several terabytes of sensor data per day. A significant portion of this data consists of 3D point clouds produced by depth sensors such as LiDARs. This data must be transferred to cloud storage, where it is utilized for training machine learning models or conducting analyses, such as forensic investigations in the event of an accident. To reduce network and storage costs, this paper introduces DejaView. Although prior work uses interframe redundancies to compress data, DejaView searches for and uses redundancies on larger temporal scales (days and months) for more effective compression. We designed DejaView with the insight that the operating area of autonomous vehicles is limited and that vehicles mostly traverse the same routes daily. Consequently, the 3D data they collect daily is likely similar to the data they have captured in the past. To capture this, the core of DejaView is a diff operation that compactly represents point clouds as delta w.r.t. 3D data from the past. Using two months of LiDAR data, an end-to-end implementation of DejaView can compress point clouds by a factor of 210 at a reconstruction error of only 15 cm.

Been There, Scanned That: Nostalgia-Driven LiDAR Compression for Self-Driving Cars

TL;DR

Abstract

Been There, Scanned That: Nostalgia-Driven LiDAR Compression for Self-Driving Cars

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (20)