Maximum Entropy Density Control of Discrete-Time Linear Systems with Quadratic Cost
Kaito Ito, Kenji Kashima
TL;DR
This work develops a complete solution to maximum entropy density control for discrete-time linear systems with Gaussian endpoints and quadratic costs. By formulating the problem as an entropy-regularized LQ control, the authors derive a pair of coupled Riccati difference equations that uniquely determine the optimal policy, which is a time-varying Gaussian with a linear state-feedback mean plus an independent noise term. They further show an equivalent backward (time-reversed) density-control problem and establish a Schrödinger-bridge–like duality, enabling simultaneous forward and backward solutions. In the zero-entropy limit, the results converge to a closed-form unregularized density-control policy, clarifying the connection to standard covariance and mean steering. A numerical example illustrates how the theory shapes trajectories and covariances under different cost weights and regularization levels, underscoring practical applicability and robustness considerations.
Abstract
This paper addresses the problem of steering the distribution of the state of a discrete-time linear system to a given target distribution while minimizing an entropy-regularized cost functional. This problem is called a maximum entropy density control problem. Specifically, the running cost is given by quadratic forms of the state and the control input, and the initial and target distributions are Gaussian. We first reveal that our problem boils down to solving two Riccati difference equations coupled through their boundary values. Based on them, we give the closed-form expression of the unique optimal policy. Next, we show that the optimal density control of a backward system can be obtained simultaneously with the forward-time optimal policy. The backward solution gives another expression of the forward solution. Finally, by considering the limit where the entropy regularization vanishes, we derive the unregularized density control in closed form.
