Learning to Anchor Visual Odometry: KAN-Based Pose Regression for Planetary Landing
Xubo Luo, Zhaojin Li, Xue Wan, Wei Zhang, Leizheng Shu
TL;DR
KANLoc addresses the critical need for accurate, real-time $6$-DoF pose estimation in autonomous lunar landings by tightly fusing high-frequency visual odometry with sparse, learned absolute pose anchors produced by a Kolmogorov-Arnold Network. The method integrates a $KAN$-based absolute pose regressor, a CPU-based ORB VO module, and a constrained bundle adjustment backend to achieve globally consistent trajectories with real-time performance. Key contributions include the novel $KAN$ architecture for efficient pose regression, a tightly-coupled VO-AVL fusion strategy, and a mask-augmented training regime that boosts robustness under occlusion and sim-to-real transfer. Results on synthetic Unreal/AirSim data and real Chang'e-3 imagery show substantial reductions in translation and rotation error (e.g., up to $32\%$ and $45\%$ respectively) and real-time operation (>$15$ FPS), highlighting its practical impact for resource-constrained lunar landers and demanding illumination conditions.
Abstract
Accurate and real-time 6-DoF localization is mission-critical for autonomous lunar landing, yet existing approaches remain limited: visual odometry (VO) drifts unboundedly, while map-based absolute localization fails in texture-sparse or low-light terrain. We introduce KANLoc, a monocular localization framework that tightly couples VO with a lightweight but robust absolute pose regressor. At its core is a Kolmogorov-Arnold Network (KAN) that learns the complex mapping from image features to map coordinates, producing sparse but highly reliable global pose anchors. These anchors are fused into a bundle adjustment framework, effectively canceling drift while retaining local motion precision. KANLoc delivers three key advances: (i) a KAN-based pose regressor that achieves high accuracy with remarkable parameter efficiency, (ii) a hybrid VO-absolute localization scheme that yields globally consistent real-time trajectories (>=15 FPS), and (iii) a tailored data augmentation strategy that improves robustness to sensor occlusion. On both realistic synthetic and real lunar landing datasets, KANLoc reduces average translation and rotation error by 32% and 45%, respectively, with per-trajectory gains of up to 45%/48%, outperforming strong baselines.
