Phase-Aware Policy Learning for Skateboard Riding of Quadruped Robots via Feature-wise Linear Modulation
Minsung Yoon, Jeil Jeong, Sung-Eui Yoon
TL;DR
The paper tackles efficient, phase-aware skateboarding with quadruped robots by addressing the multi-modal, cyclic nature of riding and perception-driven control. It introduces Phase-Aware Policy Learning (PAPL), a phase-conditioned reinforcement learning framework that uses Feature-wise Linear Modulation (FiLM) to encode phase-specific behaviors within a shared policy, along with asymmetric privileged learning and distillation to bridge sim-to-real gaps. The method combines a phase clock with a phase-conditioned reward design and exteroceptive sensing to achieve robust, steering-capable riding and energy-efficient locomotion, demonstrated in simulation and transferred to a real robot without fine-tuning. Together, these contributions yield a practical approach for autonomous skateboard-riding quadrupeds with improved robustness, efficiency, and real-world applicability.
Abstract
Skateboards offer a compact and efficient means of transportation as a type of personal mobility device. However, controlling them with legged robots poses several challenges for policy learning due to perception-driven interactions and multi-modal control objectives across distinct skateboarding phases. To address these challenges, we introduce Phase-Aware Policy Learning (PAPL), a reinforcement-learning framework tailored for skateboarding with quadruped robots. PAPL leverages the cyclic nature of skateboarding by integrating phase-conditioned Feature-wise Linear Modulation layers into actor and critic networks, enabling a unified policy that captures phase-dependent behaviors while sharing robot-specific knowledge across phases. Our evaluations in simulation validate command-tracking accuracy and conduct ablation studies quantifying each component's contribution. We also compare locomotion efficiency against leg and wheel-leg baselines and show real-world transferability.
