Meta-Learning for Adaptive Control with Automated Mirror Descent

Sunbochen Tang; Haoyuan Sun; Navid Azizan

Meta-Learning for Adaptive Control with Automated Mirror Descent

Sunbochen Tang, Haoyuan Sun, Navid Azizan

TL;DR

This work tackles adaptive control under disturbances that are linearly parameterized by unknown parameters, proposing a meta-learning framework to jointly learn nonlinear features and the mirror-descent (MD) potential. By embedding the MD adaptation law within a bi-level, multi-task meta-learning setup, the method automatically selects the Bregman divergence (via the $\ell_p$-norm family with parameter $p$) and learns feature representations $\hat{Y}(q,\dot{q};\theta_Y)$, yielding improved real-time tracking under uncertainty. Theoretical guarantees establish stability up to a bounded tracking error that vanishes when the feature approximation is exact, and numerical experiments on a planar quadrotor show significant performance gains and better generalization to out-of-distribution wind disturbances. The approach advances adaptive control by leveraging data-driven meta-optimization to tailor the geometry of the parameter space and the learned features for improved control performance in uncertain, nonlinear environments.

Abstract

Adaptive control achieves concurrent parameter learning and stable control under uncertainties that are linearly parameterized with known nonlinear features. Nonetheless, it is often difficult to obtain such nonlinear features. To address this difficulty, recent progress has been made in integrating meta-learning with adaptive control to learn such nonlinear features from data. However, these meta-learning-based control methods rely on classical adaptation laws using gradient descent, which is confined to the Euclidean geometry. In this paper, we propose a novel method that combines meta-learning and adaptation laws based on mirror descent, a popular generalization of gradient descent, which takes advantage of the potentially non-Euclidean geometry of the parameter space. In our approach, meta-learning not only learns the nonlinear features but also searches for a suitable mirror-descent potential function that optimizes control performance. Through numerical simulations, we demonstrate the effectiveness of the proposed method in learning efficient representations and real-time tracking control performance under uncertain dynamics.

Meta-Learning for Adaptive Control with Automated Mirror Descent

TL;DR

-norm family with parameter

) and learns feature representations

, yielding improved real-time tracking under uncertainty. Theoretical guarantees establish stability up to a bounded tracking error that vanishes when the feature approximation is exact, and numerical experiments on a planar quadrotor show significant performance gains and better generalization to out-of-distribution wind disturbances. The approach advances adaptive control by leveraging data-driven meta-optimization to tailor the geometry of the parameter space and the learned features for improved control performance in uncertain, nonlinear environments.

Abstract

Paper Structure (19 sections, 2 theorems, 26 equations, 12 figures, 1 table)

This paper contains 19 sections, 2 theorems, 26 equations, 12 figures, 1 table.

Introduction
Preliminaries
Meta-Learning
Adaptive Control with Mirror-Descent Adaptation
Problem Formulation
Proposed Method: Adaptive Control with Meta-Learned Bregman Divergence
Base-Learner: Adaptive Control with Mirror-Descent-based Adaptation
Bi-level Meta-Training
Model Ensembles
Theoretical Results: Stability Guarantees
Numerical Simulations
Benchmark with Baseline Method
Simulation Results
Conclusions
Background on Adaptive Control with Gradient-based Adaptation
...and 4 more sections

Key Result

theorem 1

Assume the error between the feature function $Y$ and its neural network approximation $\hat{Y}$ is bounded, and let $\delta:= \sup_{q, \dot{q}} \|Y(q, \dot{q}) - \hat{Y}(q, \dot{q})\|$. If we use $\hat{Y}$ in place of $Y$ in the adaptive controller equ:man_adap and apply it to the system in equ:man

Figures (12)

Figure 1: An illustrative diagram that shows the offline meta-learning and online adaptive control components of our proposed method. The online control components that used offline-learned parameters $\theta$ are highlighted in blue.
Figure 2: Simulation result under disturbance $w=8.0$ [$m/s$]. These figures demonstrate that our MD-based controller (blue) improves the tracking accuracy significantly over the GD-based baseline controller.
Figure 3: State time-histories under disturbance $w=2.0$ [$m/s$].
Figure 4: $x-y$ Phase plot in the case of $w=2.0$ [$m/s$].
Figure 5: State time-histories under disturbance $w=4.0$ [$m/s$].
...and 7 more figures

Theorems & Definitions (5)

remark 1
theorem 1
remark 2
lemma 1
proof

Meta-Learning for Adaptive Control with Automated Mirror Descent

TL;DR

Abstract

Meta-Learning for Adaptive Control with Automated Mirror Descent

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (5)