Driving Animatronic Robot Facial Expression From Speech
Boren Li, Hang Li, Hangxin Liu
TL;DR
This work tackles the problem of generating speech-synchronized, lifelike facial expressions on an animatronic robot. It introduces a skinning-centric framework centered on linear blend skinning (LBS) to unify embodiment design and motion synthesis, including an LBS-based actuation topology and a learning-based speech-to-blendshape pipeline. The system achieves real-time performance exceeding 4000 fps on an NVIDIA RTX 4090, with millimeter-scale tracking verified by VICON and a VOCASET-based training regime using a Wav2vec2-based speech encoder; an LBS decoder achieves consistent, lip-synchronized expressions during inference. A blind user study supports the perceptual naturalness of generated motions, and the authors release their code to advance research in speech-driven animatronic facial expression generation.
Abstract
Animatronic robots hold the promise of enabling natural human-robot interaction through lifelike facial expressions. However, generating realistic, speech-synchronized robot expressions poses significant challenges due to the complexities of facial biomechanics and the need for responsive motion synthesis. This paper introduces a novel, skinning-centric approach to drive animatronic robot facial expressions from speech input. At its core, the proposed approach employs linear blend skinning (LBS) as a unifying representation, guiding innovations in both embodiment design and motion synthesis. LBS informs the actuation topology, facilitates human expression retargeting, and enables efficient speech-driven facial motion generation. This approach demonstrates the capability to produce highly realistic facial expressions on an animatronic face in real-time at over 4000 fps on a single Nvidia RTX 4090, significantly advancing robots' ability to replicate nuanced human expressions for natural interaction. To foster further research and development in this field, the code has been made publicly available at: \url{https://github.com/library87/OpenRoboExp}.
