iMoT: Inertial Motion Transformer for Inertial Navigation

Son Minh Nguyen; Linh Duy Tran; Duc Viet Le; Paul J. M Havinga

iMoT: Inertial Motion Transformer for Inertial Navigation

Son Minh Nguyen, Linh Duy Tran, Duc Viet Le, Paul J. M Havinga

TL;DR

iMoT addresses inertial odometry by fusing acceleration and angular velocity through a Transformer-based encoder–decoder. It introduces PSD to extract informative temporal components, APE to align cross-modal positions, ASC to preserve cross-channel details, and a decoder that uses learnable query motion particles refined via DSM to model multiple motion modes. The approach achieves state-of-the-art robustness and accuracy across four large inertial datasets, especially in unseen dynamic scenarios, demonstrating strong generalization for trajectory reconstruction. This work advances practical inertial navigation by jointly modeling cross-modal cues and motion uncertainty within a unified Transformer framework, offering improved reliability for AR/VR, robotics, and related domains.

Abstract

We propose iMoT, an innovative Transformer-based inertial odometry method that retrieves cross-modal information from motion and rotation modalities for accurate positional estimation. Unlike prior work, during the encoding of the motion context, we introduce Progressive Series Decoupler at the beginning of each encoder layer to stand out critical motion events inherent in acceleration and angular velocity signals. To better aggregate cross-modal interactions, we present Adaptive Positional Encoding, which dynamically modifies positional embeddings for temporal discrepancies between different modalities. During decoding, we introduce a small set of learnable query motion particles as priors to model motion uncertainties within velocity segments. Each query motion particle is intended to draw cross-modal features dedicated to a specific motion mode, all taken together allowing the model to refine its understanding of motion dynamics effectively. Lastly, we design a dynamic scoring mechanism to stabilize iMoT's optimization by considering all aligned motion particles at the final decoding step, ensuring robust and accurate velocity segment estimation. Extensive evaluations on various inertial datasets demonstrate that iMoT significantly outperforms state-of-the-art methods in delivering superior robustness and accuracy in trajectory reconstruction.

iMoT: Inertial Motion Transformer for Inertial Navigation

TL;DR

Abstract

iMoT: Inertial Motion Transformer for Inertial Navigation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)