STAMP: Scalable Task And Model-agnostic Collaborative Perception
Xiangbo Gao, Runsheng Xu, Jiachen Li, Ziran Wang, Zhiwen Fan, Zhengzhong Tu
TL;DR
STAMP tackles heterogeneous multi-agent perception by introducing adapters and reverters that translate local BEV features into a shared protocol domain, enabling scalable, task- and model-agnostic collaboration without sharing models. The Collaborative Feature Alignment (CFA) framework jointly trains a protocol BEV embedding and lightweight adapters/reverters, enforcing alignment in both feature space and decision space. Empirical results on OPV2V and V2V4Real show STAMP achieving comparable or superior accuracy with significantly lower per-agent training costs as the number of agents grows, and strong robustness to noise. The work demonstrates a practical path toward secure, scalable multi-agent perception and discusses multi-group collaboration as a promising direction to mitigate bottlenecks in heterogeneous CP systems.
Abstract
Perception is crucial for autonomous driving, but single-agent perception is often constrained by sensors' physical limitations, leading to degraded performance under severe occlusion, adverse weather conditions, and when detecting distant objects. Multi-agent collaborative perception offers a solution, yet challenges arise when integrating heterogeneous agents with varying model architectures. To address these challenges, we propose STAMP, a scalable task- and model-agnostic, collaborative perception pipeline for heterogeneous agents. STAMP utilizes lightweight adapter-reverter pairs to transform Bird's Eye View (BEV) features between agent-specific and shared protocol domains, enabling efficient feature sharing and fusion. This approach minimizes computational overhead, enhances scalability, and preserves model security. Experiments on simulated and real-world datasets demonstrate STAMP's comparable or superior accuracy to state-of-the-art models with significantly reduced computational costs. As a first-of-its-kind task- and model-agnostic framework, STAMP aims to advance research in scalable and secure mobility systems towards Level 5 autonomy. Our project page is at https://xiangbogaobarry.github.io/STAMP and the code is available at https://github.com/taco-group/STAMP.
