Improving 2D Human Pose Estimation in Rare Camera Views with Synthetic Data
Miroslav Purkrabek, Jiri Matas
TL;DR
This work tackles the scarcity of extreme-view 2D human pose data by introducing RePoGen, an SMPL-X-based synthetic data generator that can produce novel poses and unseen views to augment COCO. By sampling from a bounded pose space and applying textures and random backgrounds, RePoGen yields diverse training data that improves top- and bottom-view pose estimation without sacrificing orbital-view accuracy, advancing performance in extreme-view scenarios. The authors present a new RePo dataset of real extreme-view images and the RePoGen dataset variants, demonstrate strong gains over baselines and AMASS-based synthesis, and show that strong rotation augmentation is crucial for extreme-view robustness. They also provide an analysis of pose spaces and emphasize that anatomical plausibility is not strictly required for effective learning, underscoring the practical impact of synthetic data in rare-camera-view contexts.
Abstract
Methods and datasets for human pose estimation focus predominantly on side- and front-view scenarios. We overcome the limitation by leveraging synthetic data and introduce RePoGen (RarE POses GENerator), an SMPL-based method for generating synthetic humans with comprehensive control over pose and view. Experiments on top-view datasets and a new dataset of real images with diverse poses show that adding the RePoGen data to the COCO dataset outperforms previous approaches to top- and bottom-view pose estimation without harming performance on common views. An ablation study shows that anatomical plausibility, a property prior research focused on, is not a prerequisite for effective performance. The introduced dataset and the corresponding code are available on https://mirapurkrabek.github.io/RePoGen-paper/ .
