Metric, inertially aligned monocular state estimation via kinetodynamic priors

Jiaxin Liu; Min Li; Wanting Xu; Liang Li; Jiaqi Yang; Laurent Kneip

Metric, inertially aligned monocular state estimation via kinetodynamic priors

Jiaxin Liu, Min Li, Wanting Xu, Liang Li, Jiaqi Yang, Laurent Kneip

TL;DR

The paper tackles monocular state estimation for non-rigid platforms where deformation breaks rigid-body assumptions. It introduces a kineto-dynamic framework that combines continuous-time B-Spline motion on $\mathbb{SE}3$ with a Deformation-force Network mapping relative pose to accelerations, linking observed trajectory acceleration to deformation-induced dynamics through $F=ma$. The approach yields metric scale and gravity alignment using only a single camera by attributing unmodeled motion to elastic deformation, effectively enabling passive inertial sensing. Experimental validation on a spring-camera setup, including simulations and real data, demonstrates robust scale recovery and inertial alignment, suggesting broad potential for flexible robotic platforms with known motion models and elastic actuation.

Abstract

Accurate state estimation for flexible robotic systems poses significant challenges, particular for platforms with dynamically deforming structures that invalidate rigid-body assumptions. This paper tackles this problem and allows to extend existing rigid-body pose estimation methods to non-rigid systems. Our approach hinges on two core assumptions: first, the elastic properties are captured by an injective deformation-force model, efficiently learned via a Multi-Layer Perceptron; second, we solve the platform's inherently smooth motion using continuous-time B-spline kinematic models. By continuously applying Newton's Second Law, our method establishes a physical link between visually-derived trajectory acceleration and predicted deformation-induced acceleration. We demonstrate that our approach not only enables robust and accurate pose estimation on non-rigid platforms, but that the properly modeled platform physics instigate inertial sensing properties. We demonstrate this feasibility on a simple spring-camera system, and show how it robustly resolves the typically ill-posed problem of metric scale and gravity recovery in monocular visual odometry.

Metric, inertially aligned monocular state estimation via kinetodynamic priors

TL;DR

Abstract

Metric, inertially aligned monocular state estimation via kinetodynamic priors

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)