RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond
Daniel Bermuth, Alexander Poeppel, Wolfgang Reif
TL;DR
RapidPoseTriangulation presents a fast, learning-free approach to multi-view, multi-person whole-body pose estimation by performing pairwise 2D-to-3D triangulation of detected joints, followed by 3D-space merging and optional tracking. The method emphasizes geometric consistency and early pruning to achieve real-time performance and strong generalization across unseen datasets without heavy training. It demonstrates competitive accuracy on standard benchmarks, supports full-body outputs (including hands and face), and significantly outpaces voxel- and learning-based baselines in speed. The authors provide public source code to encourage adoption and further advances in real-time multi-view pose analysis for human-robot interaction and other applications.
Abstract
The integration of multi-view imaging and pose estimation represents a significant advance in computer vision applications, offering new possibilities for understanding human movement and interactions. This work presents a new algorithm that improves multi-view multi-person pose estimation, focusing on fast triangulation speeds and good generalization capabilities. The approach extends to whole-body pose estimation, capturing details from facial expressions to finger movements across multiple individuals and viewpoints. Adaptability to different settings is demonstrated through strong performance across unseen datasets and configurations. To support further progress in this field, all of this work is publicly accessible.
