X-IONet: Cross-Platform Inertial Odometry Network with Dual-Stage Attention
Dehan Shen, Changhao Chen
TL;DR
X-IONet addresses cross-platform inertial odometry by combining an IMU-only front-end with a rule-based expert selector, a dual-stage attention displacement predictor, and EKF-based state estimation. The method captures long-range temporal and inter-axis dependencies, outputs displacement with uncertainty, and fuses this information for robust global localization. It achieves state-of-the-art performance on both RoNIN (pedestrian) and a Go2 quadruped dataset, with significant reductions in ATE and RTE compared to strong baselines, and ablations validate the importance of each component, including the Huber–Gaussian loss for uncertainty modeling. The work demonstrates strong cross-platform generalization and robustness to aggressive and irregular motion, suggesting practical applicability for IMU-only navigation across humans and legged robots and potential for broader multimodal integration.
Abstract
Learning-based inertial odometry has achieved remarkable progress in pedestrian navigation. However, extending these methods to quadruped robots remains challenging due to their distinct and highly dynamic motion patterns. Models that perform well on pedestrian data often experience severe degradation when deployed on legged platforms. To tackle this challenge, we introduce X-IONet, a cross-platform inertial odometry framework that operates solely using a single Inertial Measurement Unit (IMU). X-IONet incorporates a rule-based expert selection module to classify motion platforms and route IMU sequences to platform-specific expert networks. The displacement prediction network features a dual-stage attention architecture that jointly models long-range temporal dependencies and inter-axis correlations, enabling accurate motion representation. It outputs both displacement and associated uncertainty, which are further fused through an Extended Kalman Filter (EKF) for robust state estimation. Extensive experiments on public pedestrian datasets and a self-collected quadruped robot dataset demonstrate that X-IONet achieves state-of-the-art performance, reducing Absolute Trajectory Error (ATE) by 14.3% and Relative Trajectory Error (RTE) by 11.4% on pedestrian data, and by 52.8% and 41.3% on quadruped robot data. These results highlight the effectiveness of X-IONet in advancing accurate and robust inertial navigation across both human and legged robot platforms.
