Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking

Authors

  • Xin Tong Intelligent Science & Technology Academy of CASIC
  • Shi Peng Intelligent Science & Technology Academy of CASIC
  • Baojie Tian Intelligent Science & Technology Academy of CASIC
  • Yufei Guo Intelligent Science & Technology Academy of CASIC
  • Xuhui Huang Intelligent Science & Technology Academy of CASIC
  • Zhe Ma Intelligent Science & Technology Academy of CASIC

DOI:

https://doi.org/10.1609/aaai.v39i7.32799

Abstract

Classical Transformer-based line segment detection methods have delivered impressive results. However, we observe that some accurately detected line segments are assigned low confidence scores during prediction, causing them to be ranked lower and potentially suppressed. Additionally, these models often require prolonged training periods to achieve strong performance, largely due to the necessity of bipartite matching. In this paper, we introduce RANK-LETR, a novel Transformer-based line segment detection method. Our approach leverages learnable geometric information to refine the ranking of predicted line segments by enhancing the confidence scores of high-quality predictions in a posterior verification step. We also propose a new line segment proposal method, wherein the feature point nearest to the centroid of the line segment directly predicts the location, significantly improving training efficiency and stability. Moreover, we introduce a line segment ranking loss to stabilize rankings during training, thereby enhancing the generalization capability of the model. Experimental results demonstrate that our method outperforms other Transformer-based and CNN-based approaches in prediction accuracy while requiring fewer training epochs than previous Transformer-based models.

Published

2025-04-11

How to Cite

Tong, X., Peng, S., Tian, B., Guo, Y., Huang, X., & Ma, Z. (2025). Improving Transformer Based Line Segment Detection with Matched Predicting and Re-ranking. Proceedings of the AAAI Conference on Artificial Intelligence, 39(7), 7428–7436. https://doi.org/10.1609/aaai.v39i7.32799

Issue

Section

AAAI Technical Track on Computer Vision VI