ChatEMG: Synthetic Data Generation to Control a Robotic Hand Orthosis for Stroke
Jingxi Xu, Runsheng Wang, Siqi Shang, Ava Chen, Lauren Winterbottom, To-Liang Hsu, Wenxi Chen, Khondoker Ahmed, Pedro Leandro La Rotta, Xinyue Zhu, Dawn M. Nilsen, Joel Stein, Matei Ciocarlie
TL;DR
This paper tackles the data bottleneck and variability in EMG-based intent inferral for stroke-hand orthoses. It introduces ChatEMG, a transformer-based autoregressive model trained on a large offline EMG corpus that can generate personalized synthetic EMG sequences conditioned on prompts from a new condition/session/subject. By augmenting a small amount of real data with synthetic samples, ChatEMG improves classifier performance across multiple architectures and adaptation scenarios, and can be deployed within a single patient session to enable functional tasks. The work demonstrates substantial gains in intent inferral accuracy and provides visualization evidence that synthetic data closely matches real data distributions, supporting practical deployment in rehabilitation robotics.
Abstract
Intent inferral on a hand orthosis for stroke patients is challenging due to the difficulty of data collection. Additionally, EMG signals exhibit significant variations across different conditions, sessions, and subjects, making it hard for classifiers to generalize. Traditional approaches require a large labeled dataset from the new condition, session, or subject to train intent classifiers; however, this data collection process is burdensome and time-consuming. In this paper, we propose ChatEMG, an autoregressive generative model that can generate synthetic EMG signals conditioned on prompts (i.e., a given sequence of EMG signals). ChatEMG enables us to collect only a small dataset from the new condition, session, or subject and expand it with synthetic samples conditioned on prompts from this new context. ChatEMG leverages a vast repository of previous data via generative training while still remaining context-specific via prompting. Our experiments show that these synthetic samples are classifier-agnostic and can improve intent inferral accuracy for different types of classifiers. We demonstrate that our complete approach can be integrated into a single patient session, including the use of the classifier for functional orthosis-assisted tasks. To the best of our knowledge, this is the first time an intent classifier trained partially on synthetic data has been deployed for functional control of an orthosis by a stroke survivor. Videos, source code, and additional information can be found at https://jxu.ai/chatemg.
