An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation
Zewen Xu, Yijia He, Hao Wei, Bo Xu, BinJian Xie, Yihong Wu
TL;DR
RT$^2$PL introduces a real-time three-view pose solver that decouples rotation and translation estimation using point-line observations. Rotation is inferred via NEC for points and NBC for lines, incorporating observation uncertainty and solved with Levenberg–Marquardt and IRLS weighting, while translations are obtained through low-degree LiGT constraints for both points and lines. The approach achieves improved accuracy over trifocal-tensor methods and two-view baselines, with robust performance under degeneracies such as pure rotation and planar configurations. Extensive synthetic and real-world experiments demonstrate reliable, fast pose estimation and effective fusion of point and line features, highlighting the method's practical relevance for VO/SfM in weak-texture environments.
Abstract
Line features are valid complements for point features in man-made environments. 3D-2D constraints provided by line features have been widely used in Visual Odometry (VO) and Structure-from-Motion (SfM) systems. However, how to accurately solve three-view relative motion only with 2D observations of points and lines in real time has not been fully explored. In this paper, we propose a novel three-view pose solver based on rotation-translation decoupled estimation. First, a high-precision rotation estimation method based on normal vector coplanarity constraints that consider the uncertainty of observations is proposed, which can be solved by Levenberg-Marquardt (LM) algorithm efficiently. Second, a robust linear translation constraint that minimizes the degree of the rotation components and feature observation components in equations is elaborately designed for estimating translations accurately. Experiments on synthetic data and real-world data show that the proposed approach improves both rotation and translation accuracy compared to the classical trifocal-tensor-based method and the state-of-the-art two-view algorithm in outdoor and indoor environments.
