Learning Robust Control Policies for Inverted Pose on Miniature Blimp Robots

Yuanlin Yang; Lin Hong; Fumin Zhang

Learning Robust Control Policies for Inverted Pose on Miniature Blimp Robots

Yuanlin Yang, Lin Hong, Fumin Zhang

TL;DR

A novel framework that enables robust control policy learning for inverted pose on miniature blimp robots and confirms that the learned policy with a mapping layer enables an MBR to achieve and maintain a fully upside-down pose in real-world settings.

Abstract

The ability to achieve and maintain inverted poses is essential for unlocking the full agility of miniature blimp robots (MBRs). However, developing reliable control methods for MBRs remains challenging due to their complex and underactuated dynamics. To address this challenge, we propose a novel framework that enables robust control policy learning for inverted pose on MBRs. The proposed framework operates through three core stages: First, a high-fidelity three-dimensional (3D) simulation environment was constructed, which was calibrated against real-world MBR motion data to ensure accurate replication of inverted-state dynamics. Second, a robust policy for MBR inverted control was trained within the simulation environment via a domain randomization strategy and a modified Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm. Third, a mapping layer was designed to bridge the sim-to-real gap for the learned policy deployment. Comprehensive evaluations in the simulation environment demonstrate that the learned policy achieves a higher success rate compared to the energy-shaping controller. Furthermore, experimental results confirm that the learned policy with a mapping layer enables an MBR to achieve and maintain a fully upside-down pose in real-world settings.

Learning Robust Control Policies for Inverted Pose on Miniature Blimp Robots

TL;DR

Abstract

Paper Structure (17 sections, 11 equations, 13 figures, 7 tables, 2 algorithms)

This paper contains 17 sections, 11 equations, 13 figures, 7 tables, 2 algorithms.

INTRODUCTION
Problem Formulation
Dynamic Model of MBRs
Inverted Control Problem Statement
Methodology
Simulation Environment
Physics-informed Domain Randomization
TD3 with Multi-buffer and Clipping
Implementation Details
Evaluation and Experiment
Performance of the Trained Policy to $m_w$ Variations
Performance of the Trained Policy to $\lambda$ Variations
Performance of the Trained Policy to $g_m$ Variations
Performance of the Trained Policy to $m_w$, $\lambda$ and $g_m$ Variations
Ablation Study
...and 2 more sections

Figures (13)

Figure 1: Overview of the proposed method for inverted control of MBRs. The MBR with an upright pose can reach and maintain the inverted pose with the learned robust policy.
Figure 2: Dynamic model analysis of the MBR: (a) Upright pose; (b) Inverted pose.
Figure 3: Pipeline of developing and deploying robust policy for inverted control of MBRs.
Figure 4: 3D simulation environment designed for robust policy learning for inverted control of MBRs.
Figure 5: The MBR's structure in the simulation environment.
...and 8 more figures

Learning Robust Control Policies for Inverted Pose on Miniature Blimp Robots

TL;DR

Abstract

Learning Robust Control Policies for Inverted Pose on Miniature Blimp Robots

Authors

TL;DR

Abstract

Table of Contents

Figures (13)