Distributionally Robust Imitation Learning: Layered Control Architecture for Certifiable Autonomy
Aditya Gahlawat, Ahmed Aboudonia, Sandeep Banik, Naira Hovakimyan, Nikolai Matni, Aaron D. Ames, Gioele Zardini, Alberto Speranzon
TL;DR
The paper tackles distribution shifts in imitation learning by proposing DRIP, a Layered Control Architecture that unites TaSIL (robustness to policy shifts) with $\mathcal{L}_1$-DRAC (robustness to modeling uncertainty). It decouples policy-induced and uncertainty-induced gaps, deriving tractable bounds: TaSIL yields a trajectory-level imitation gap that scales as $O\left(\frac{\log n}{n}\right)$ with data size, while $\mathcal{L}_1$-DRAC provides a training-free, probabilistic bound largely independent of data size. The resulting total imitation gap is the sum of these certifiable components, enabling a fully certifiable autonomy pipeline that integrates learning with robust control and perception. Numerical experiments illustrate that TaSIL alone may fail under uncertainty, whereas the combined DRIP architecture stabilizes the system and bounds the imitation gap, highlighting practical implications for certifiable autonomous systems.
Abstract
Imitation learning (IL) enables autonomous behavior by learning from expert demonstrations. While more sample-efficient than comparative alternatives like reinforcement learning, IL is sensitive to compounding errors induced by distribution shifts. There are two significant sources of distribution shifts when using IL-based feedback laws on systems: distribution shifts caused by policy error and distribution shifts due to exogenous disturbances and endogenous model errors due to lack of learning. Our previously developed approaches, Taylor Series Imitation Learning (TaSIL) and $\mathcal{L}_1$ -Distributionally Robust Adaptive Control (\ellonedrac), address the challenge of distribution shifts in complementary ways. While TaSIL offers robustness against policy error-induced distribution shifts, \ellonedrac offers robustness against distribution shifts due to aleatoric and epistemic uncertainties. To enable certifiable IL for learned and/or uncertain dynamical systems, we formulate \textit{Distributionally Robust Imitation Policy (DRIP)} architecture, a Layered Control Architecture (LCA) that integrates TaSIL and~\ellonedrac. By judiciously designing individual layer-centric input and output requirements, we show how we can guarantee certificates for the entire control pipeline. Our solution paves the path for designing fully certifiable autonomy pipelines, by integrating learning-based components, such as perception, with certifiable model-based decision-making through the proposed LCA approach.
