End-to-End Motion Capture from Rigid Body Markers with Geodesic Loss
Hai Lan, Zongyan Li, Jianmin Hu, Jialing Yang, Houde Dai
TL;DR
The paper introduces Rigid Body Markers (RBMs) as a sparse, unambiguous 6-DoF input unit for marker-based MoCap and demonstrates end-to-end SMPL parameter regression using a geodesic loss on SO(3). By synthesizing RBM data from AMASS and aligning real RBMs via a T-pose calibration, the approach achieves state-of-the-art accuracy while significantly reducing computation compared to optimization-based methods. The key contributions are the RBM hardware concept and the geodesic loss as a rotation-aware training objective, which together enable real-time, high-fidelity motion capture suitable for graphics, VR, and biomechanics. The method is validated on synthetic AMASS data and real-world Vicon data, and ablation studies confirm the benefits of pose normalization and the geodesic loss. Overall, the work offers a practical framework that matches or exceeds dense-marker performance with far lower setup complexity and computational cost.
Abstract
Marker-based optical motion capture (MoCap), while long regarded as the gold standard for accuracy, faces practical challenges, such as time-consuming preparation and marker identification ambiguity, due to its reliance on dense marker configurations, which fundamentally limit its scalability. To address this, we introduce a novel fundamental unit for MoCap, the Rigid Body Marker (RBM), which provides unambiguous 6-DoF data and drastically simplifies setup. Leveraging this new data modality, we develop a deep-learning-based regression model that directly estimates SMPL parameters under a geodesic loss. This end-to-end approach matches the performance of optimization-based methods while requiring over an order of magnitude less computation. Trained on synthesized data from the AMASS dataset, our end-to-end model achieves state-of-the-art accuracy in body pose estimation. Real-world data captured using a Vicon optical tracking system further demonstrates the practical viability of our approach. Overall, the results show that combining sparse 6-DoF RBM with a manifold-aware geodesic loss yields a practical and high-fidelity solution for real-time MoCap in graphics, virtual reality, and biomechanics.
