Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting
Pawel Knap, Peter Hardy, Alberto Tamajo, Hwasup Lim, Hansung Kim
TL;DR
The paper tackles real-time 3D multi-person pose estimation in a 360° scene by fusing an omnidirectional camera with mmWave radars to overcome depth/scale ambiguities and occlusion. It presents a complete pipeline that combines OpenPose for 2D keypoints, radar-based localization to a global frame, and unsupervised 2D-3D lifting via the LInKs model, with careful camera–radar calibration and an improved cross-sensor matching module. Key contributions include calibration methods, a refined image–radar matching approach, and effective occlusion handling, achieving around 7–8 fps on a laptop GPU. The results demonstrate robust, scalable performance for indoor and outdoor environments, offering an affordable real-time solution for multi-person 3D pose estimation in challenging scenes.
Abstract
Current human pose estimation systems focus on retrieving an accurate 3D global estimate of a single person. Therefore, this paper presents one of the first 3D multi-person human pose estimation systems that is able to work in real-time and is also able to handle basic forms of occlusion. First, we adjust an off-the-shelf 2D detector and an unsupervised 2D-3D lifting model for use with a 360$^\circ$ panoramic camera and mmWave radar sensors. We then introduce several contributions, including camera and radar calibrations, and the improved matching of people within the image and radar space. The system addresses both the depth and scale ambiguity problems by employing a lightweight 2D-3D pose lifting algorithm that is able to work in real-time while exhibiting accurate performance in both indoor and outdoor environments which offers both an affordable and scalable solution. Notably, our system's time complexity remains nearly constant irrespective of the number of detected individuals, achieving a frame rate of approximately 7-8 fps on a laptop with a commercial-grade GPU.
