Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving
Sándor Kunsági-Máté, Levente Pető, Lehel Seres, Tamás Matuszka
TL;DR
The paper tackles the lack of large-scale, accurate 3D annotations for static traffic-management objects (traffic lights and road signs) in autonomous driving. It introduces a LiDAR-free pipeline that combines 2D detections from RGB images with ego-motion data to triangulate 3D centers, estimate bounding box extents and orientations, and suppress false positives through image-space re-projection, achieving localization errors around 0.2–0.3 m up to 200 m. A representative dataset of ~50,000 auto-annotated frames (≈320k lights, 550k signs) is released under CC BY-NC-SA 4.0, enabling training of image-based perception models for long-range traffic management objects. The approach demonstrates strong quantitative performance on Waymo and in-house datasets and provides a practical, scalable resource for non-commercial research that advances 3D perception without reliance on LiDAR.
Abstract
3D detection of traffic management objects, such as traffic lights and road signs, is vital for self-driving cars, particularly for address-to-address navigation where vehicles encounter numerous intersections with these static objects. This paper introduces a novel method for automatically generating accurate and temporally consistent 3D bounding box annotations for traffic lights and signs, effective up to a range of 200 meters. These annotations are suitable for training real-time models used in self-driving cars, which need a large amount of training data. The proposed method relies only on RGB images with 2D bounding boxes of traffic management objects, which can be automatically obtained using an off-the-shelf image-space detector neural network, along with GNSS/INS data, eliminating the need for LiDAR point cloud data.
