SynthoGestures: A Novel Framework for Synthetic Dynamic Hand Gesture Generation for Driving Scenarios
Amr Gomaa, Robin Zitt, Guillermo Reyes, Antonio Krüger
TL;DR
The paper addresses the data bottleneck for dynamic hand gesture recognition in driving contexts by introducing SynthoGestures, an Unreal Engine-based framework that synthesizes diverse, dynamic gestures across camera modalities (RGB, depth, infrared) and viewpoints. It combines configurable gesture variations (speed, hand shape, lighting) with both description-based and animation-based generation modes, guided by a spline and an IK-based Control Rig for realistic motion. Experimental results show that augmenting real data with synthetic data improves recognition performance when trained from scratch, particularly with substantial synthetic variation, suggesting a practical path to faster development of automotive gesture recognition systems. Overall, SynthoGestures offers a cost-effective, flexible tool to accelerate gesture data generation and model training beyond automotive to other HCI domains.
Abstract
Creating a diverse and comprehensive dataset of hand gestures for dynamic human-machine interfaces in the automotive domain can be challenging and time-consuming. To overcome this challenge, we propose using synthetic gesture datasets generated by virtual 3D models. Our framework utilizes Unreal Engine to synthesize realistic hand gestures, offering customization options and reducing the risk of overfitting. Multiple variants, including gesture speed, performance, and hand shape, are generated to improve generalizability. In addition, we simulate different camera locations and types, such as RGB, infrared, and depth cameras, without incurring additional time and cost to obtain these cameras. Experimental results demonstrate that our proposed framework, SynthoGestures (https://github.com/amrgomaaelhady/SynthoGestures), improves gesture recognition accuracy and can replace or augment real-hand datasets. By saving time and effort in the creation of the data set, our tool accelerates the development of gesture recognition systems for automotive applications.
