LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment
Yiming Ren, Xiao Han, Chengfeng Zhao, Jingya Wang, Lan Xu, Jingyi Yu, Yuexin Ma
TL;DR
LiveHPS addresses LiDAR-based scene-level 3D human pose and shape estimation in free environments by introducing a vertex-guided adaptive distillation framework, a consecutive pose optimizer that leverages temporal-spatial cues, and a skeleton-aware translation solver. The approach delivers full SMPL parameter estimation (pose, shape, translation) from single-LiDAR data without lighting or wearable constraints, achieving state-of-the-art results on FreeMotion and other datasets. A large-scale FreeMotion dataset with multi-view, multi-modal annotations supports robust learning and benchmarking, including privacy-conscious data handling. The work demonstrates strong generalization, occlusion robustness, and real-time performance, highlighting practical applicability for real-world robotics, AR/VR, and autonomous systems.
Abstract
For human-centric large-scale scenes, fine-grained modeling for 3D human global pose and shape is significant for scene understanding and can benefit many real-world applications. In this paper, we present LiveHPS, a novel single-LiDAR-based approach for scene-level human pose and shape estimation without any limitation of light conditions and wearable devices. In particular, we design a distillation mechanism to mitigate the distribution-varying effect of LiDAR point clouds and exploit the temporal-spatial geometric and dynamic information existing in consecutive frames to solve the occlusion and noise disturbance. LiveHPS, with its efficient configuration and high-quality output, is well-suited for real-world applications. Moreover, we propose a huge human motion dataset, named FreeMotion, which is collected in various scenarios with diverse human poses, shapes and translations. It consists of multi-modal and multi-view acquisition data from calibrated and synchronized LiDARs, cameras, and IMUs. Extensive experiments on our new dataset and other public datasets demonstrate the SOTA performance and robustness of our approach. We will release our code and dataset soon.
