Tracking Mouse from Incomplete Body-Part Observations and Deep-Learned Deformable-Mouse Model Motion-Track Constraint for Behavior Analysis
Olaf Hellwich, Niek Andresen, Katharina Hohlbaum, Marcus N. Boon, Monika Kwiatkowski, Simon Matern, Patrik Reiske, Henning Sprekeler, Christa ThöneReineke, Lars Lewejohann, Huma Ghani Zada, Michael Brück, Soledad Traverso
TL;DR
This work tackles the challenge of occlusion-driven gaps in $3D$ mouse body-part tracking by integrating multi-view video within a global coordinate system, enabling triangulation and bundle adjustment. It introduces a motion-track constraint that enforces temporal consistency of exterior orientation parameters and couples a rigid mouse model with deep-learned deformable body-part movements, implemented via a transformer/LSTM framework to predict deformable coordinates. The approach is instantiated through a functional, stochastic, and initialization-aware least-squares bundle adjustment across three cameras, validated with simulated data and real-world observations from DeepLabCut, demonstrating more complete and geometrically accurate 3D motion estimates for subsequent behavior analysis. The method has practical significance for detailed behavioral studies in mice and potentially other subjects where occlusions limit single-view tracking, providing richer motion representations to support classification and interpretation of learned behaviors.
Abstract
Tracking mouse body parts in video is often incomplete due to occlusions such that - e.g. - subsequent action and behavior analysis is impeded. In this conceptual work, videos from several perspectives are integrated via global exterior camera orientation; body part positions are estimated by 3D triangulation and bundle adjustment. Consistency of overall 3D track reconstruction is achieved by introduction of a 3D mouse model, deep-learned body part movements, and global motion-track smoothness constraint. The resulting 3D body and body part track estimates are substantially more complete than the original single-frame-based body part detection, therefore, allowing improved animal behavior analysis.
