A Model-Based Approach to Imitation Learning through Multi-Step Predictions
Haldun Balim, Yang Hu, Yuyang Zhang, Na Li
TL;DR
This work addresses imitation learning under distribution shift and measurement noise by introducing Predictive Imitation Learning (PIL), a model-based framework that leverages multi-step predictors and a consistency loss to anticipate long-horizon consequences without costly full-horizon unrolls. It provides finite-sample performance guarantees for linear time-invariant systems and demonstrates superior long-horizon robustness compared to Behavior Cloning and rollout-based IL across linear, nonlinear, and MuJoCo control tasks. The results show that horizon-aware, predictive modeling improves stability and generalization in imitation, highlighting a principled alternative to purely model-free or purely rollout-based methods. The approach has potential to impact safety-critical control and robotics by offering scalable, predictive imitation with provable guarantees.
Abstract
Imitation learning is a widely used approach for training agents to replicate expert behavior in complex decision-making tasks. However, existing methods often struggle with compounding errors and limited generalization, due to the inherent challenge of error correction and the distribution shift between training and deployment. In this paper, we present a novel model-based imitation learning framework inspired by model predictive control, which addresses these limitations by integrating predictive modeling through multi-step state predictions. Our method outperforms traditional behavior cloning numerical benchmarks, demonstrating superior robustness to distribution shift and measurement noise both in available data and during execution. Furthermore, we provide theoretical guarantees on the sample complexity and error bounds of our method, offering insights into its convergence properties.
