Adaptive Torque Control of Exoskeletons under Spasticity Conditions via Reinforcement Learning
Andrés Chavarrías, David Rodriguez-Cianca, Pablo Lanillos
TL;DR
<3-5 sentence high-level summary>The paper tackles safety and personalization in wearable knee exoskeletons for people with spasticity, where velocity-dependent reflexes alter dynamics. It introduces a digital twin combining musculoskeletal and exoskeleton models with a differentiable spasticity model. An adaptive torque controller based on Soft Actor-Critic learns to mitigate spasticity effects and reduce interaction torques compared with a PID baseline. Results show average knee torque reductions of about 10.6% and RMS reductions of about 8.9%, supporting the viability of RL for safe, subject-specific WR control.
Abstract
Spasticity is a common movement disorder symptom in individuals with cerebral palsy, hereditary spastic paraplegia, spinal cord injury and stroke, being one of the most disabling features in the progression of these diseases. Despite the potential benefit of using wearable robots to treat spasticity, their use is not currently recommended to subjects with a level of spasticity above ${1^+}$ on the Modified Ashworth Scale. The varying dynamics of this velocity-dependent tonic stretch reflex make it difficult to deploy safe personalized controllers. Here, we describe a novel adaptive torque controller via deep reinforcement learning (RL) for a knee exoskeleton under joint spasticity conditions, which accounts for task performance and interaction forces reduction. To train the RL agent, we developed a digital twin, including a musculoskeletal-exoskeleton system with joint misalignment and a differentiable spastic reflexes model for the muscles activation. Results for a simulated knee extension movement showed that the agent learns to control the exoskeleton for individuals with different levels of spasticity. The proposed controller was able to reduce maximum torques applied to the human joint under spastic conditions by an average of 10.6\% and decreases the root mean square until the settling time by 8.9\% compared to a conventional compliant controller.
