Control-Oriented Identification for the Linear Quadratic Regulator: Technical Report
Sean Anderson, João Pedro Hespanha
TL;DR
This work tackles data-driven control under model uncertainty by proposing a control-oriented offline experiment design that optimizes post-experiment controller performance. It develops a gradient-descent framework that uses a pathwise gradient estimator to solve the resulting nonconvex design problem, enabling SGD-based optimization of the experiment inputs. Specializing to the finite-horizon LQR, it derives a weighted Bayesian system identification scheme coupled with certainty-equivalence control and provides gradient expressions to enable efficient optimization. Numerical experiments in a car-string setting show the proposed design outperforms traditional A- and L-optimal designs and a robust dual-control approach, with favorable scaling and practical implications for data-efficient controller design.
Abstract
Data-driven control benefits from rich datasets, but constructing such datasets becomes challenging when gathering data is limited. We consider an offline experiment design approach to gathering data where we design a control input to collect data that will most improve the performance of a feedback controller. We show how such a control-oriented approach can be used in a setting with linear dynamics and quadratic objective and, through design of a gradient estimator, solve the problem via stochastic gradient descent. We show our formulation numerically outperforms an A- and L-optimal experiment design approach as well as a robust dual control approach.
