JDT3D: Addressing the Gaps in LiDAR-Based Tracking-by-Attention
Brian Cheong, Jiachen Zhou, Steven Waslander
TL;DR
JDT3D advances LiDAR-based tracking-by-attention by integrating end-to-end joint detection and tracking with two key enhancements: track sampling augmentation to enrich temporal supervision and confidence-based query propagation to align training and inference. On nuScenes, it achieves AMOTA $=0.574$ and AMOTP $=0.837$, outperforming LiDAR-based TBA methods by over $6\%$ in AMOTA and reducing ID switches, while clinical analysis reveals the remaining gap with TBD stems from weaker multi-frame detection and temporal confusion. The work shows that end-to-end JDT with longer temporal contexts and a more capable decoder can substantially improve LiDAR TBA, and it provides a clear analysis of where improvements are most needed. Overall, JDT3D demonstrates the viability of bridging the TBD–TBA gap in LiDAR MOT and offers concrete, generalizable strategies for future LiDAR-based trackers.
Abstract
Tracking-by-detection (TBD) methods achieve state-of-the-art performance on 3D tracking benchmarks for autonomous driving. On the other hand, tracking-by-attention (TBA) methods have the potential to outperform TBD methods, particularly for long occlusions and challenging detection settings. This work investigates why TBA methods continue to lag in performance behind TBD methods using a LiDAR-based joint detector and tracker called JDT3D. Based on this analysis, we propose two generalizable methods to bridge the gap between TBD and TBA methods: track sampling augmentation and confidence-based query propagation. JDT3D is trained and evaluated on the nuScenes dataset, achieving 0.574 on the AMOTA metric on the nuScenes test set, outperforming all existing LiDAR-based TBA approaches by over 6%. Based on our results, we further discuss some potential challenges with the existing TBA model formulation to explain the continued gap in performance with TBD methods. The implementation of JDT3D can be found at the following link: https://github.com/TRAILab/JDT3D.
