SDR-GAIN: A High Real-Time Occluded Pedestrian Pose Completion Method for Autonomous Driving
Honghao Fu, Yongli Gu, Yidong Yan, Yilang Shen, Yiwen Wu, Libo Sun
TL;DR
The paper tackles real-time completion of occluded pedestrian poses in autonomous driving by introducing SDR-GAIN, a lightweight self-supervised GAN framework that imputes missing keypoints from coordinate distributions via pose separation and dimensionality reduction. It leverages dual generators for head and torso, masked-learning with hints, and pose standardization to learn robust spatial relationships, achieving microsecond inference and outperforming both traditional imputation methods and Transformer-based approaches on COCO and JAAD. Ablation studies validate the effectiveness of separation, dimensionality reduction, and the two-generator design, while highlighting remaining challenges such as dataset size and adversarial training stability. Overall, SDR-GAIN offers a practical, real-time solution for occlusion-resilient pedestrian pose estimation in autonomous driving systems.
Abstract
With the advancement of vision-based autonomous driving technology, pedestrian detection have become an important component for improving traffic safety and driving system robustness. Nevertheless, in complex traffic scenarios, conventional pose estimation approaches frequently fail to accurately reconstruct occluded keypoints, primarily due to obstructions caused by vehicles, vegetation, or architectural elements. To address this issue, we propose a novel real-time occluded pedestrian pose completion framework termed Separation and Dimensionality Reduction-based Generative Adversarial Imputation Nets (SDR-GAIN). Unlike previous approaches that train visual models to distinguish occlusion patterns, SDR-GAIN aims to learn human pose directly from the numerical distribution of keypoint coordinates and interpolate missing positions. It employs a self-supervised adversarial learning paradigm to train lightweight generators with residual structures for the imputation of missing pose keypoints. Additionally, it integrates multiple pose standardization techniques to alleviate the difficulty of the learning process. Experiments conducted on the COCO and JAAD datasets demonstrate that SDR-GAIN surpasses conventional machine learning and Transformer-based missing data interpolation algorithms in accurately recovering occluded pedestrian keypoints, while simultaneously achieving microsecond-level real-time inference.
