CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework

Yushan Han; Hui Zhang; Honglei Zhang; Jing Wang; Yidong Li

CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework

Yushan Han, Hui Zhang, Honglei Zhang, Jing Wang, Yidong Li

TL;DR

CoDTS tackles sparse supervision in collaborative perception by introducing a dual teacher-student framework that combines Main Foreground Mining and Supplement Foreground Mining with Adaptive Thresholding and Neighbor Anchor Sampling. The two-stage training strategy (warm-up and refinement) enables mutual learning between the student and a dynamic teacher, yielding pseudo labels that are both high in quality and abundant in quantity. Across four large-scale datasets, CoDTS consistently surpasses prior sparsely supervised methods and approaches full-supervision performance, while also outperforming several semi-supervised baselines, demonstrating strong practical impact for cost-effective, robust multi-agent perception.

Abstract

Current collaborative perception methods often rely on fully annotated datasets, which can be expensive to obtain in practical situations. To reduce annotation costs, some works adopt sparsely supervised learning techniques and generate pseudo labels for the missing instances. However, these methods fail to achieve an optimal confidence threshold that harmonizes the quality and quantity of pseudo labels. To address this issue, we propose an end-to-end Collaborative perception Dual Teacher-Student framework (CoDTS), which employs adaptive complementary learning to produce both high-quality and high-quantity pseudo labels. Specifically, the Main Foreground Mining (MFM) module generates high-quality pseudo labels based on the prediction of the static teacher. Subsequently, the Supplement Foreground Mining (SFM) module ensures a balance between the quality and quantity of pseudo labels by adaptively identifying missing instances based on the prediction of the dynamic teacher. Additionally, the Neighbor Anchor Sampling (NAS) module is incorporated to enhance the representation of pseudo labels. To promote the adaptive complementary learning, we implement a staged training strategy that trains the student and dynamic teacher in a mutually beneficial manner. Extensive experiments demonstrate that the CoDTS effectively ensures an optimal balance of pseudo labels in both quality and quantity, establishing a new state-of-the-art in sparsely supervised collaborative perception. The code is available at https://github.com/CatOneTwo/CoDTS.

CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework

TL;DR

Abstract

CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)