Neural Control: Concurrent System Identification and Control Learning with Neural ODE

Cheng Chi

Neural Control: Concurrent System Identification and Control Learning with Neural ODE

Cheng Chi

TL;DR

The paper addresses controlling unknown continuous-time dynamical systems by jointly learning dynamics and optimal control via a coupled Neural ODE (NC). The approach replaces separate system identification with end-to-end training of two interacting networks: a dynamics learner $g_{\gamma}$ and a controller $h_{\theta}$, linked through $\dot{x}=g_{\gamma}(x, h_{\theta}(x,t), t)$. Training optimizes both trajectory fidelity to real dynamics and adherence to the control objective, using an alternating training scheme to progressively improve both components. Demonstrations on a linear Ax+Bu system and the CartPole task show data-efficient learning, achieving near-optimal control with significantly fewer real trajectories than typical reinforcement learning methods. This integration promises practical impact for unknown-dynamics control where data is scarce or expensive to collect.

Abstract

Controlling continuous-time dynamical systems is generally a two step process: first, identify or model the system dynamics with differential equations, then, minimize the control objectives to achieve optimal control function and optimal state trajectories. However, any inaccuracy in dynamics modeling will lead to sub-optimality in the resulting control function. To address this, we propose a neural ODE based method for controlling unknown dynamical systems, denoted as Neural Control (NC), which combines dynamics identification and optimal control learning using a coupled neural ODE. Through an intriguing interplay between the two neural networks in coupled neural ODE structure, our model concurrently learns system dynamics as well as optimal controls that guides towards target states. Our experiments demonstrate the effectiveness of our model for learning optimal control of unknown dynamical systems. Codes available at https://github.com/chichengmessi/neural_ode_control/tree/main

Neural Control: Concurrent System Identification and Control Learning with Neural ODE

TL;DR

and a controller

, linked through

. Training optimizes both trajectory fidelity to real dynamics and adherence to the control objective, using an alternating training scheme to progressively improve both components. Demonstrations on a linear Ax+Bu system and the CartPole task show data-efficient learning, achieving near-optimal control with significantly fewer real trajectories than typical reinforcement learning methods. This integration promises practical impact for unknown-dynamics control where data is scarce or expensive to collect.

Abstract

Paper Structure (15 sections, 9 equations, 9 figures, 1 algorithm)

This paper contains 15 sections, 9 equations, 9 figures, 1 algorithm.

Introduction
Method
Classical optimal control
Neural ODE
Neural Control: NC
Model architecture
Training Loss
Alternative Training
Experiments
Control task: Linear System
Control Task: CartPole
Conclusion
Alternative training motivation
Compare with analytical optimal control
Dynamic learner and Controller training loss

Figures (9)

Figure 1: Training loss for dynamics learner $g_{\gamma}$ and controller $h_{\theta}$
Figure 2: The $k_{th}$ alternation of the dynamics learner ${g}_{\gamma}$ and controller ${h}_{\theta}$ alternative training
Figure 3: The full training process of NC for Ax+Bu task
Figure 4: CartPole dynamical system
Figure 5: The full training process of NC for CartPole task
...and 4 more figures

Neural Control: Concurrent System Identification and Control Learning with Neural ODE

TL;DR

Abstract

Neural Control: Concurrent System Identification and Control Learning with Neural ODE

Authors

TL;DR

Abstract

Table of Contents

Figures (9)