Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking

Xin Tong; Shi Peng; Baojie Tian; Yufei Guo; Xuhui Huang; Zhe Ma

Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking

Xin Tong, Shi Peng, Baojie Tian, Yufei Guo, Xuhui Huang, Zhe Ma

TL;DR

This work addresses the miscalibration and slow convergence of Transformer-based line segment detection by introducing RANK-LETR, which uses centroid-based matched predicting to eliminate online bipartite matching and a geometry-informed line re-ranking to better reflect line quality. It also adds a line segment ranking loss to train feature points to favor higher-quality predictions, and deploys high-resolution predictions with rotation augmentation in a Deformable Transformer backbone. The method demonstrates superior accuracy over both Transformer-based and CNN-based baselines on the Wireframe and YorkUrban datasets, with faster convergence (e.g., after ~60 epochs). These contributions offer improved detection precision and training efficiency, making Transformer-based LSD more practical for real-world applications.

Abstract

Classical Transformer-based line segment detection methods have delivered impressive results. However, we observe that some accurately detected line segments are assigned low confidence scores during prediction, causing them to be ranked lower and potentially suppressed. Additionally, these models often require prolonged training periods to achieve strong performance, largely due to the necessity of bipartite matching. In this paper, we introduce RANK-LETR, a novel Transformer-based line segment detection method. Our approach leverages learnable geometric information to refine the ranking of predicted line segments by enhancing the confidence scores of high-quality predictions in a posterior verification step. We also propose a new line segment proposal method, wherein the feature point nearest to the centroid of the line segment directly predicts the location, significantly improving training efficiency and stability. Moreover, we introduce a line segment ranking loss to stabilize rankings during training, thereby enhancing the generalization capability of the model. Experimental results demonstrate that our method outperforms other Transformer-based and CNN-based approaches in prediction accuracy while requiring fewer training epochs than previous Transformer-based models.

Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking

TL;DR

Abstract

Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)