V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization

Wenkai Lin; Qiming Xia; Wen Li; Xun Huang; Chenglu Wen

V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization

Wenkai Lin, Qiming Xia, Wen Li, Xun Huang, Chenglu Wen

TL;DR

This work tackles the challenge of collaborative perception without GNSS signals by leveraging LiDAR-based localization to align multi-agent observations. It introduces a two-module pipeline—PGC for pose and confidence estimation and PASTAT for confidence-aware spatio-temporal alignment—together with the V2VLoc dataset for regression-based localization and collaborative detection. The approach achieves state-of-the-art performance under GNSS-denied conditions on V2VLoc and demonstrates generalization to real-world data (V2V4Real). Key contributions include first application of LiDAR localization to GNSS-free feature alignment in collaboration, a dedicated dataset for GNSS-free collaboration, and ablations validating the effectiveness of PGC and PASTAT. The results suggest robust, bandwidth-efficient collaborative perception in challenging environments.

Abstract

Multi-agents rely on accurate poses to share and align observations, enabling a collaborative perception of the environment. However, traditional GNSS-based localization often fails in GNSS-denied environments, making consistent feature alignment difficult in collaboration. To tackle this challenge, we propose a robust GNSS-free collaborative perception framework based on LiDAR localization. Specifically, we propose a lightweight Pose Generator with Confidence (PGC) to estimate compact pose and confidence representations. To alleviate the effects of localization errors, we further develop the Pose-Aware Spatio-Temporal Alignment Transformer (PASTAT), which performs confidence-aware spatial alignment while capturing essential temporal context. Additionally, we present a new simulation dataset, V2VLoc, which can be adapted for both LiDAR localization and collaborative detection tasks. V2VLoc comprises three subsets: Town1Loc, Town4Loc, and V2VDet. Town1Loc and Town4Loc offer multi-traversal sequences for training in localization tasks, whereas V2VDet is specifically intended for the collaborative detection task. Extensive experiments conducted on the V2VLoc dataset demonstrate that our approach achieves state-of-the-art performance under GNSS-denied conditions. We further conduct extended experiments on the real-world V2V4Real dataset to validate the effectiveness and generalizability of PASTAT.

V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization

TL;DR

Abstract

V2VLoc: Robust GNSS-Free Collaborative Perception via LiDAR Localization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)