FLD: Fourier Latent Dynamics for Structured Motion Representation and Learning
Chenhao Li, Elijah Stanger-Jones, Steve Heim, Sangbae Kim
TL;DR
The paper introduces Fourier Latent Dynamics (FLD), a self-supervised, structured representation that enforces latent dynamics in a continuously parameterized space to model periodic and quasi-periodic motions. Building on Periodic Autoencoder (PAE), FLD propagates latent states with globally constant parameters $(f,a,b)$ and uses a multi-step loss $L_{FLD}^N=\sum_{i=0}^N \alpha^i L_i$ to achieve stable long-horizon prediction, enabling autoregressive motion synthesis. In the motion-learning pipeline, FLD supplies targets via a learned latent space $\Theta$ and a local phase $\phi_t$, guiding a policy trained with PPO; online tracking includes a threshold-based evaluation and a fallback mechanism to safe states when proposals deviate from learned dynamics. Through experiments on a MIT Humanoid platform with locomotion data, FLD demonstrates a more structured latent manifold, improved long-horizon reconstruction/prediction, and robust online tracking with adaptive skill samplers (e.g., ALPGMM) that enhance generalization while avoiding unlearnable regions. Overall, FLD offers a principled approach to open-ended motion learning by exploiting spatial-temporal structure in the latent space to interpolate, generate, and safely track novel motions.
Abstract
Motion trajectories offer reliable references for physics-based motion learning but suffer from sparsity, particularly in regions that lack sufficient data coverage. To address this challenge, we introduce a self-supervised, structured representation and generation method that extracts spatial-temporal relationships in periodic or quasi-periodic motions. The motion dynamics in a continuously parameterized latent space enable our method to enhance the interpolation and generalization capabilities of motion learning algorithms. The motion learning controller, informed by the motion parameterization, operates online tracking of a wide range of motions, including targets unseen during training. With a fallback mechanism, the controller dynamically adapts its tracking strategy and automatically resorts to safe action execution when a potentially risky target is proposed. By leveraging the identified spatial-temporal structure, our work opens new possibilities for future advancements in general motion representation and learning algorithms.
