Self-Localized Collaborative Perception
Zhenyang Ni, Zixing Lei, Yifan Lu, Dingju Wang, Chen Feng, Yanfeng Wang, Siheng Chen
TL;DR
Self-localized collaborative perception addresses the vulnerability of external localization to noise and attacks by deriving relative poses from perception data alone. The authors introduce BEVGlue as a spatial alignment module and CoBEVGlue as an end-to-end self-localized collaboration system, with object-graph modeling and temporally consistent maximum common subgraph detection to fuse BEV features across agents. They demonstrate state-of-the-art robustness under localization noise and spoofing attacks on real and simulated datasets OPV2V, DAIR-V2X, and V2V4Real, and show BEVGlue can substantially boost other methods by about 57.7% with minimal bandwidth. The work advances practical multi-agent perception by delivering localization-free collaboration with high efficiency and accuracy.
Abstract
Collaborative perception has garnered considerable attention due to its capacity to address several inherent challenges in single-agent perception, including occlusion and out-of-range issues. However, existing collaborative perception systems heavily rely on precise localization systems to establish a consistent spatial coordinate system between agents. This reliance makes them susceptible to large pose errors or malicious attacks, resulting in substantial reductions in perception performance. To address this, we propose~$\mathtt{CoBEVGlue}$, a novel self-localized collaborative perception system, which achieves more holistic and robust collaboration without using an external localization system. The core of~$\mathtt{CoBEVGlue}$ is a novel spatial alignment module, which provides the relative poses between agents by effectively matching co-visible objects across agents. We validate our method on both real-world and simulated datasets. The results show that i) $\mathtt{CoBEVGlue}$ achieves state-of-the-art detection performance under arbitrary localization noises and attacks; and ii) the spatial alignment module can seamlessly integrate with a majority of previous methods, enhancing their performance by an average of $57.7\%$. Code is available at https://github.com/VincentNi0107/CoBEVGlue
