Toward Reliable Human Pose Forecasting with Uncertainty
Saeed Saadatnejad, Mehrshad Mirmohammadi, Matin Daghyani, Parham Saremi, Yashar Zoroofchi Benisi, Amirhossein Alimohammadi, Zahra Tehraninasab, Taylor Mordan, Alexandre Alahi
TL;DR
This work tackles the lack of unified evaluation and uncertainty analysis in human pose forecasting by introducing UnPOSed, an open-source library with standardized datasets and metrics. It proposes two uncertainty paradigms: aleatoric uncertainty is modeled with priors that canalize learning toward shorter horizons, improving short-horizon accuracy by up to 25% without harming longer-horizons; epistemic uncertainty is quantified via a model-agnostic, clustering-based metric (EpU) derived from latent motion representations, enabling reliable out-of-distribution detection. The approach combines an uncertainty-aware loss with time–joint priors and a deep embedded clustering framework to estimate EpU, validated on Human3.6M, AMASS, and 3DPW, showing improved forecasting and better uncertainty estimation. Overall, the work contributes practical uncertainty handling for pose forecasting and a standardized, extensible benchmark to promote uncertainty-aware development in the field.
Abstract
Recently, there has been an arms race of pose forecasting methods aimed at solving the spatio-temporal task of predicting a sequence of future 3D poses of a person given a sequence of past observed ones. However, the lack of unified benchmarks and limited uncertainty analysis have hindered progress in the field. To address this, we first develop an open-source library for human pose forecasting, including multiple models, supporting several datasets, and employing standardized evaluation metrics, with the aim of promoting research and moving toward a unified and consistent evaluation. Second, we devise two types of uncertainty in the problem to increase performance and convey better trust: 1) we propose a method for modeling aleatoric uncertainty by using uncertainty priors to inject knowledge about the pattern of uncertainty. This focuses the capacity of the model in the direction of more meaningful supervision while reducing the number of learned parameters and improving stability; 2) we introduce a novel approach for quantifying the epistemic uncertainty of any model through clustering and measuring the entropy of its assignments. Our experiments demonstrate up to $25\%$ improvements in forecasting at short horizons, with no loss on longer horizons on Human3.6M, AMSS, and 3DPW datasets, and better performance in uncertainty estimation. The code is available online at https://github.com/vita-epfl/UnPOSed.
