Normalizing Flows on the Product Space of SO(3) Manifolds for Probabilistic Human Pose Modeling
Olaf Dünkel, Tim Salzmann, Florian Pfaff
TL;DR
The paper addresses probabilistic modeling of human pose by learning a normalized density over the product space of SO(3) joint rotations. It introduces HuProSO3, a normalizing flow built from Möbius coupling layers and a quaternion affine transformation, with nonlinear autoregressive conditioning across joints and random conditioning orders to capture complex dependencies; the model can condition on context and support exact likelihood evaluation. The authors demonstrate HuProSO3 on an unconditional pose prior, inverse kinematics with and without occlusion, and 2D-to-3D uplifting, showing improved density modeling and competitive or superior performance to state-of-the-art priors and 6D NF baselines. This work enables uncertainty-aware, geometry-consistent human pose estimation for vision, robotics, and human-robot interaction tasks by modeling correlated joint rotations in a principled probabilistic framework.
Abstract
Normalizing flows have proven their efficacy for density estimation in Euclidean space, but their application to rotational representations, crucial in various domains such as robotics or human pose modeling, remains underexplored. Probabilistic models of the human pose can benefit from approaches that rigorously consider the rotational nature of human joints. For this purpose, we introduce HuProSO3, a normalizing flow model that operates on a high-dimensional product space of SO(3) manifolds, modeling the joint distribution for human joints with three degrees of freedom. HuProSO3's advantage over state-of-the-art approaches is demonstrated through its superior modeling accuracy in three different applications and its capability to evaluate the exact likelihood. This work not only addresses the technical challenge of learning densities on SO(3) manifolds, but it also has broader implications for domains where the probabilistic regression of correlated 3D rotations is of importance.
