Accelerating Sampling-Based Control via Learned Linear Koopman Dynamics

Wenjian Hao; Yuxuan Fang; Zehui Lu; Shaoshuai Mou

Accelerating Sampling-Based Control via Learned Linear Koopman Dynamics

Wenjian Hao, Yuxuan Fang, Zehui Lu, Shaoshuai Mou

TL;DR

To improve the computational efficiency of classic MPPI while preserving control performance, the nonlinear dynamics used for trajectory propagation are replaced with a learned linear deep Koopman operator (DKO) model, enabling faster rollout and more efficient trajectory sampling.

Abstract

This paper presents an efficient model predictive path integral (MPPI) control framework for systems with complex nonlinear dynamics. To improve the computational efficiency of classic MPPI while preserving control performance, we replace the nonlinear dynamics used for trajectory propagation with a learned linear deep Koopman operator (DKO) model, enabling faster rollout and more efficient trajectory sampling. The DKO dynamics are learned directly from interaction data, eliminating the need for analytical system models. The resulting controller, termed MPPI-DK, is evaluated in simulation on pendulum balancing and surface vehicle navigation tasks, and validated on hardware through reference-tracking experiments on a quadruped robot. Experimental results demonstrate that MPPI-DK achieves control performance close to MPPI with true dynamics while substantially reducing computational cost, enabling efficient real-time control on robotic platforms.

Accelerating Sampling-Based Control via Learned Linear Koopman Dynamics

TL;DR

Abstract

Paper Structure (11 sections, 28 equations, 4 figures, 3 tables, 1 algorithm)

This paper contains 11 sections, 28 equations, 4 figures, 3 tables, 1 algorithm.

Introduction
The Problem and Preliminaries
Problem Formulation
Dynamics Learning using DKO
MPPI control
Methodology
Numerical Simulations
Pendulum Balancing
Surface Vehicle Navigation
Experiments Results
Concluding Remarks

Figures (4)

Figure 1: Average trajectory cost over $5$ independent trials for different lifting function architectures and training datasets.
Figure 2: Average trajectory cost over $5$ independent trials for different lifting dimensions and training datasets.
Figure 3: Tracking error over time. The solid line represents the mean over $4$ independent trials, and the shaded region indicates one standard deviation.
Figure 4: Representative trajectories generated by MPPI-DK and classic MPPI using true dynamics on a Unitree Go1 quadruped robot. In the experiment, the robot starts from the initial state $(x, y, \theta) = (-0.5 \text{m}, 0.4\text{m}, 0.3\text{rad})$ and navigates to the goal state $(x_{\mathrm{goal}}, y_{\mathrm{goal}}, \theta_{\mathrm{goal}}) = (1.5\text{m}, 0\text{m}, 0\text{rad})$.

Theorems & Definitions (1)

Remark 1

Accelerating Sampling-Based Control via Learned Linear Koopman Dynamics

TL;DR

Abstract

Accelerating Sampling-Based Control via Learned Linear Koopman Dynamics

Authors

TL;DR

Abstract

Table of Contents

Figures (4)

Theorems & Definitions (1)