Discrete-Time Hybrid Automata Learning: Legged Locomotion Meets Skateboarding
Hang Liu, Sangli Teng, Ben Liu, Wei Zhang, Maani Ghaffari
TL;DR
Discrete-Time Hybrid Automata Learning (DHAL) tackles mode-switching in hybrid dynamical systems for legged locomotion by learning a discrete mode selector and mode-specific dynamics online, without trajectory segmentation or event-function labeling. The framework combines a DHA with a $\\beta$-VAE dynamics encoder, a Beta-distribution policy, and a Multi-Critic PPO-based RL loop to handle contact-rich, underactuated skateboarding on a quadruped robot, achieving sim-to-real transfer. Key contributions include a unsupervised mode identification mechanism, a discrete-time formulation that unifies switching and dynamics, and empirical validation showing robust mode-aware control and intuitive mode decomposition consistent with physical intuition. The work advances practical hybrid-control learning for complex robotics tasks, enabling stable, adaptive locomotion across varied terrains and disturbances.
Abstract
Hybrid dynamical systems, which include continuous flow and discrete mode switching, can model robotics tasks like legged robot locomotion. Model-based methods usually depend on predefined gaits, while model-free approaches lack explicit mode-switching knowledge. Current methods identify discrete modes via segmentation before regressing continuous flow, but learning high-dimensional complex rigid body dynamics without trajectory labels or segmentation is a challenging open problem. This paper introduces Discrete-time Hybrid Automata Learning (DHAL), a framework to identify and execute mode-switching without trajectory segmentation or event function learning. Besides, we embedded it in reinforcement learning pipeline and incorporates a beta policy distribution and a multi-critic architecture to model contact-guided motions, exemplified by a challenging quadrupedal robot skateboard task. We validate our method through sufficient real-world tests, demonstrating robust performance and mode identification consistent with human intuition in hybrid dynamical systems.
