TopoStreamer: Temporal Lane Segment Topology Reasoning in Autonomous Driving
Yiming Yang, Yueru Luo, Bingkun He, Hongbin Lin, Suzhong Fu, Chao Zheng, Zhipeng Cao, Erlong Li, Chao Yan, Shuguang Cui, Zhen Li
TL;DR
TopoStreamer tackles the problem of temporal lane topology reasoning for autonomous driving by introducing streaming attribute constraints, dynamic lane boundary positional encoding, and lane segment denoising. The method combines an BEV-based encoder with a dynamic, boundary-aware transformer decoder and a memory-based temporal propagation mechanism to maintain consistent lane segment attributes and topology across frames. Empirical results on OpenLane-V2 show state-of-the-art gains in lane segment perception and centerline topology, with ablations confirming the effectiveness of each proposed component and the importance of lossless streaming supervision. The work advances end-to-end temporal road-network understanding, with practical implications for robust planning and lane-change decision-making in dynamic driving scenarios.
Abstract
Lane segment topology reasoning constructs a comprehensive road network by capturing the topological relationships between lane segments and their semantic types. This enables end-to-end autonomous driving systems to perform road-dependent maneuvers such as turning and lane changing. However, the limitations in consistent positional embedding and temporal multiple attribute learning in existing methods hinder accurate roadnet reconstruction. To address these issues, we propose TopoStreamer, an end-to-end temporal perception model for lane segment topology reasoning. Specifically, TopoStreamer introduces three key improvements: streaming attribute constraints, dynamic lane boundary positional encoding, and lane segment denoising. The streaming attribute constraints enforce temporal consistency in both centerline and boundary coordinates, along with their classifications. Meanwhile, dynamic lane boundary positional encoding enhances the learning of up-to-date positional information within queries, while lane segment denoising helps capture diverse lane segment patterns, ultimately improving model performance. Additionally, we assess the accuracy of existing models using a lane boundary classification metric, which serves as a crucial measure for lane-changing scenarios in autonomous driving. On the OpenLane-V2 dataset, TopoStreamer demonstrates significant improvements over state-of-the-art methods, achieving substantial performance gains of +3.0% mAP in lane segment perception and +1.7% OLS in centerline perception tasks.
