TopoMaskV2: Enhanced Instance-Mask-Based Formulation for the Road Topology Problem
M. Esat Kalfaoglu, Halil Ibrahim Ozturk, Ozsel Kilinc, Alptekin Temizel
TL;DR
TopoMask addresses road topology by predicting centerlines as flow-aware instance masks in the BEV domain. It introduces quad-direction labels to encode centerline flow, employs a masked-attention transformer decoder, and uses a three-stage quad-direction post-processing to convert masks into ordered 3D centerline point sets, while a Bezier head provides complementary geometric cues. A fusion mechanism combines mask-derived and Bezier-derived outputs to boost both centerline accuracy and topology relations, and a multi-height bin Lift-Splat BEV variant preserves height information to further improve performance. On OpenLane-V2, TopoMask achieves state-of-the-art results across Subset-A and Subset-B, and across multiple metrics, while also providing a detailed analysis of attention types and metric considerations. The work highlights potential extensions in height prediction and topology evaluation, underscoring the practical impact for robust road topology understanding in autonomous driving.
Abstract
Recently, the centerline has become a popular representation of lanes due to its advantages in solving the road topology problem. To enhance centerline prediction, we have developed a new approach called TopoMask. Unlike previous methods that rely on keypoints or parametric methods, TopoMask utilizes an instance-mask-based formulation coupled with a masked-attention-based transformer architecture. We introduce a quad-direction label representation to enrich the mask instances with flow information and design a corresponding post-processing technique for mask-to-centerline conversion. Additionally, we demonstrate that the instance-mask formulation provides complementary information to parametric Bezier regressions, and fusing both outputs leads to improved detection and topology performance. Moreover, we analyze the shortcomings of the pillar assumption in the Lift Splat technique and adapt a multi-height bin configuration. Experimental results show that TopoMask achieves state-of-the-art performance in the OpenLane-V2 dataset, increasing from 44.1 to 49.4 for Subset-A and 44.7 to 51.8 for Subset-B in the V1.1 OLS baseline.
