Human Imitated Bipedal Locomotion with Frequency Based Gait Generator Network
Yusuf Baran Ates, Omer Morgul
TL;DR
This work tackles robust, human-like bipedal locomotion by marrying a frequency-domain gait generator learned from motion capture with a torque-controlled PPO policy. The gait generator maps morphology and desired speed to smooth joint trajectories via a 32-point FFT, and the PPO controller refines these references under an imitation-plus-gait reward, aided by random state initialization. The method demonstrates improved speed range, slope tolerance, and terrain robustness compared with CMA-ES SIMBICON and standard PPO baselines, while achieving smoother, more human-like joint coordination. This approach offers a practical, data-efficient path to natural locomotion with modest training costs and has potential for extension to full 3D humanoids and richer sensory feedback.
Abstract
Learning human-like, robust bipedal walking remains difficult due to hybrid dynamics and terrain variability. We propose a lightweight framework that combines a gait generator network learned from human motion with Proximal Policy Optimization (PPO) controller for torque control. Despite being trained only on flat or mildly sloped ground, the learned policies generalize to steeper ramps and rough surfaces. Results suggest that pairing spectral motion priors with Deep Reinforcement Learning (DRL) offers a practical path toward natural and robust bipedal locomotion with modest training cost.
