Learning Multiple Gaits within Latent Space for Quadruped Robots

Jinze Wu; Yufei Xue; Chenkun Qi

Learning Multiple Gaits within Latent Space for Quadruped Robots

Jinze Wu, Yufei Xue, Chenkun Qi

TL;DR

This work designs gait-dependent rewards that are constructed explicitly from gait parameters and implicitly from conditional adversarial motion priors (CAMP), and demonstrates such multiple gaits control on a quadruped robot Go1 with only proprioceptive sensors.

Abstract

Learning multiple gaits is non-trivial for legged robots, especially when encountering different terrains and velocity commands. In this work, we present an end-to-end training framework for learning multiple gaits for quadruped robots, tailored to the needs of robust locomotion, agile locomotion, and user's commands. A latent space is constructed concurrently by a gait encoder and a gait generator, which helps the agent to reuse multiple gait skills to achieve adaptive gait behaviors. To learn natural behaviors for multiple gaits, we design gait-dependent rewards that are constructed explicitly from gait parameters and implicitly from conditional adversarial motion priors (CAMP). We demonstrate such multiple gaits control on a quadruped robot Go1 with only proprioceptive sensors.

Learning Multiple Gaits within Latent Space for Quadruped Robots

TL;DR

Abstract

Paper Structure (16 sections, 8 equations, 5 figures, 5 tables)

This paper contains 16 sections, 8 equations, 5 figures, 5 tables.

INTRODUCTION
Related RL Methods for Locomotion
Contributions
METHOD
Reinforcement Learning Problem Formulation
Network Architecture
Latent Space for Gait Skills
Reward Functions Design
Training
Training Setup
Training Curriculum
RESULTS AND DISCUSSION
Latent Space Analysis
Ablation Study for the Learning Framework
Outdoor Experiments
...and 1 more sections

Figures (5)

Figure 1: Overview of the learning framework. We employ the asymmetric actor-critic framework to learn robust blind locomotion in one training phase. A latent space $\boldsymbol{Z}$ is constructed concurrently by a gait encoder and a gait generator, which helps the agent to reuse multiple gait skills to achieve adaptive gait behaviors. We use a conditional discriminator to guide the policy to reproduce the natural gait motions that are similar to the CAMP dataset generated from TO.
Figure 2: The original space of the gait parameters (left) and the latent space of the gait skills (right).
Figure 3: The t-SNE visualization for the gait skills in the latent space.
Figure 4: The DTW distance of the gait skills in the latent space. Darker colors represent higher proximity between the two gait skills in the latent space.
Figure 5: The robust (left) and agile (right) tests outdoors.

Learning Multiple Gaits within Latent Space for Quadruped Robots

TL;DR

Abstract

Learning Multiple Gaits within Latent Space for Quadruped Robots

Authors

TL;DR

Abstract

Table of Contents

Figures (5)