Natural Gradient Descent for Control

Ramin Esmzad; Farnaz Adib Yaghmaie; Hamidreza Modares

Natural Gradient Descent for Control

Ramin Esmzad, Farnaz Adib Yaghmaie, Hamidreza Modares

TL;DR

This work addresses direct trajectory control for stochastic LTI systems by reframing controller design as a natural gradient descent problem preconditioned by the state covariance. The authors derive a Fisher Information Matrix-based preconditioner, show how stationary covariance drives the design via LMIs that yield a $\lambda$-contractive closed-loop with $K=F Y^{-1}$, and establish stability with explicit step-size bounds. A key contribution is representing the closed-loop dynamics through a gradient-flow perspective, enabling explicit trajectory shaping that is more interpretable than traditional LQR tuning. Simulation on a Quanser rotary pendulum demonstrates predictable, trajectory-focused behavior compared to standard LQR, highlighting the framework's potential for transparent low-level control and future extensions to nonlinear, time-varying, RL-integrated, and MPC-based settings.

Abstract

This paper bridges optimization and control, and presents a novel closed-loop control framework based on natural gradient descent, offering a trajectory-oriented alternative to traditional cost-function tuning. By leveraging the Fisher Information Matrix, we formulate a preconditioned gradient descent update that explicitly shapes system trajectories. We show that, in sharp contrast to traditional controllers, our approach provides flexibility to shape the system's low-level behavior. To this end, the proposed method parameterizes closed-loop dynamics in terms of stationary covariance and an unknown cost function, providing a geometric interpretation of control adjustments. We establish theoretical stability conditions. The simulation results on a rotary inverted pendulum benchmark highlight the advantages of natural gradient descent in trajectory shaping.

Natural Gradient Descent for Control

TL;DR

Abstract

Natural Gradient Descent for Control

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (4)