QE-BEV: Query Evolution for Bird's Eye View Object Detection in Varied Contexts
Jiawei Yao, Yingxin Lai, Hongrui Kou, Tong Wu, Ruixi Liu
TL;DR
The paper addresses BEV-based 3D object detection in dynamic scenes and identifies limitations of static and simple dynamic queries in exploiting temporal context. It proposes QE-BEV, which combines Dynamic Query Evolution Module with K-means clustering and Top-K Attention, plus a Lightweight Temporal Fusion Module and Diversity Loss to balance attention while reusing computations. Empirical results on nuScenes and Waymo show state-of-the-art NDS and mAP improvements with improved efficiency, including NDS 56.1 with ResNet50 (57.8 with perspective pretraining) and 61.1 with ResNet101 on nuScenes, and strong Waymo metrics. The work advances BEV-based detectors toward real-time, long-range temporal reasoning in autonomous driving.
Abstract
3D object detection plays a pivotal role in autonomous driving and robotics, demanding precise interpretation of Bird's Eye View (BEV) images. The dynamic nature of real-world environments necessitates the use of dynamic query mechanisms in 3D object detection to adaptively capture and process the complex spatio-temporal relationships present in these scenes. However, prior implementations of dynamic queries have often faced difficulties in effectively leveraging these relationships, particularly when it comes to integrating temporal information in a computationally efficient manner. Addressing this limitation, we introduce a framework utilizing dynamic query evolution strategy, harnesses K-means clustering and Top-K attention mechanisms for refined spatio-temporal data processing. By dynamically segmenting the BEV space and prioritizing key features through Top-K attention, our model achieves a real-time, focused analysis of pertinent scene elements. Our extensive evaluation on the nuScenes and Waymo dataset showcases a marked improvement in detection accuracy, setting a new benchmark in the domain of query-based BEV object detection. Our dynamic query evolution strategy has the potential to push the boundaries of current BEV methods with enhanced adaptability and computational efficiency. Project page: https://github.com/Jiawei-Yao0812/QE-BEV
