Exploring the Limitations of Behavior Cloning for Autonomous Driving
Felipe Codevilla, Eder Santana, Antonio M. López, Adrien Gaidon
TL;DR
This paper investigates end-to-end behavior cloning for autonomous driving using a new NoCrash benchmark in the CARLA simulator. It introduces a strong off-policy Conditional Imitation Learning baseline (CILRS) with a deeper ResNet backbone and a speed-prediction branch, achieving state-of-the-art results in complex urban scenarios and unseen environments. The study systematically reveals limitations of behavior cloning, including dataset bias, causal confusion, inertia, and high training variance, especially in dynamic traffic, and proposes NoCrash to better assess reactions to dynamic agents. It demonstrates the need for causal models, data diversity, and improved training stability before end-to-end BC can be considered for real-world deployment.
Abstract
Driving requires reacting to a wide variety of complex environment conditions and agent behaviors. Explicitly modeling each possible scenario is unrealistic. In contrast, imitation learning can, in theory, leverage data from large fleets of human-driven cars. Behavior cloning in particular has been successfully used to learn simple visuomotor policies end-to-end, but scaling to the full spectrum of driving behaviors remains an unsolved problem. In this paper, we propose a new benchmark to experimentally investigate the scalability and limitations of behavior cloning. We show that behavior cloning leads to state-of-the-art results, including in unseen environments, executing complex lateral and longitudinal maneuvers without these reactions being explicitly programmed. However, we confirm well-known limitations (due to dataset bias and overfitting), new generalization issues (due to dynamic objects and the lack of a causal model), and training instability requiring further research before behavior cloning can graduate to real-world driving. The code of the studied behavior cloning approaches can be found at https://github.com/felipecode/coiltraine .
