A Spatial Calibration Method for Robust Cooperative Perception

Zhiying Song; Tenghui Xie; Hailiang Zhang; Jiaxin Liu; Fuxi Wen; Jun Li

A Spatial Calibration Method for Robust Cooperative Perception

Zhiying Song, Tenghui Xie, Hailiang Zhang, Jiaxin Liu, Fuxi Wen, Jun Li

TL;DR

The paper tackles robust spatial calibration for cooperative perception under pose and perception noise. It proposes context-based matching (CBM), a lightweight, bounding-box–only inter-agent object association framework that builds intra-agent context, performs coarse matching with global consensus, and estimates the relative transform to fuse multi-view detections. CBM achieves decimeter-level relative pose accuracy and shows strong resilience to non-co-visible objects and measurement noise, outperforming prior methods on real-world (SIND) and simulated (OPV2V) datasets. The approach enables reliable V2X perception with minimal feature extraction and communication, suitable for scalable deployment in intelligent transportation systems. The results indicate significant improvements in transform accuracy (RRE, RTE) and perception quality (mAP) under varied localization errors.

Abstract

Cooperative perception is a promising technique for intelligent and connected vehicles through vehicle-to-everything (V2X) cooperation, provided that accurate pose information and relative pose transforms are available. Nevertheless, obtaining precise positioning information often entails high costs associated with navigation systems. {Hence, it is required to calibrate relative pose information for multi-agent cooperative perception.} This paper proposes a simple but effective object association approach named context-based matching (CBM), which identifies inter-agent object correspondences using intra-agent geometrical context. In detail, this method constructs contexts using the relative position of the detected bounding boxes, followed by local context matching and global consensus maximization. The optimal relative pose transform is estimated based on the matched correspondences, followed by cooperative perception fusion. Extensive experiments are conducted on both the simulated and real-world datasets. Even with larger inter-agent localization errors, high object association precision and decimeter-level relative pose calibration accuracy are achieved among the cooperating agents.

A Spatial Calibration Method for Robust Cooperative Perception

TL;DR

Abstract

Paper Structure (14 sections, 20 equations, 7 figures, 1 table, 1 algorithm)

This paper contains 14 sections, 20 equations, 7 figures, 1 table, 1 algorithm.

Introduction
Related work
method
Problem formulation
Context-based inter-agent object association
Intra-agent context construction
Context similarity-based coarse matching
Global consensus maximization
Transform estimation and perception fusion
Validation on real-world dataset
Experiments setting
Evaluation of inter-agent association performance
Evaluation of transform estimation and perception
Conclusions

Figures (7)

Figure 1: Effect of spatial calibration on cooperative perception.
Figure 2: Illustration of the object sets.
Figure 3: Framework of the proposed method CBM. Object sets $\mathcal{I}_{\mathcal{X}}$ and $\mathcal{I}_{\mathcal{Y}}$ are detected by the onboard perception system of the Ego vehicle and CAV, respectively. Subsequently, the Ego establishes context based on $\mathcal{I}_{\mathcal{X}}$ and $\mathcal{I}_{\mathcal{Y}}$. Preliminary correspondences between objects in $\mathcal{I}_{\mathcal{X}}$ and $\mathcal{I}_{\mathcal{Y}}$ are identified via the local matching module, and subsequently refined through the global consensus module. Finally, the relative pose between the Ego and CAV is estimated.
Figure 4: Context of object $X_1$ in the detections of the Ego and CAV, respectively.
Figure 5: Quantitative results and qualitative demonstration on SIND.
...and 2 more figures

A Spatial Calibration Method for Robust Cooperative Perception

TL;DR

Abstract

A Spatial Calibration Method for Robust Cooperative Perception

Authors

TL;DR

Abstract

Table of Contents

Figures (7)