IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving

Clémence Grislain; Risto Vuorio; Cong Lu; Shimon Whiteson

IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving

Clémence Grislain, Risto Vuorio, Cong Lu, Shimon Whiteson

TL;DR

The paper tackles the imitation gap in imitation learning for autonomous driving, formalizing the gap as a mismatch between expert and imitator observations $O_{expert} \neq O_{imitator}$. It introduces IGDrivSim, a benchmark atop the Waymax simulator that imposes partial observability to systematically study how BC-based IL performs when perception differs from human drivers. The key finding is that BC alone often fails to learn safe and effective policies under the imitation gap, but integrating BC with reinforcement learning through a simple penalty reward (PPO-based) significantly mitigates failures and improves safety metrics. By releasing open-source code and motion-prediction baselines, the work provides a practical tool for evaluating and developing perception-aware driving policies tailored to the sensors of self-driving cars.

Abstract

Developing autonomous vehicles that can navigate complex environments with human-level safety and efficiency is a central goal in self-driving research. A common approach to achieving this is imitation learning, where agents are trained to mimic human expert demonstrations collected from real-world driving scenarios. However, discrepancies between human perception and the self-driving car's sensors can introduce an $\textit{imitation}$ gap, leading to imitation learning failures. In this work, we introduce $\textbf{IGDrivSim}$, a benchmark built on top of the Waymax simulator, designed to investigate the effects of the imitation gap in learning autonomous driving policy from human expert demonstrations. Our experiments show that this perception gap between human experts and self-driving agents can hinder the learning of safe and effective driving behaviors. We further show that combining imitation with reinforcement learning, using a simple penalty reward for prohibited behaviors, effectively mitigates these failures. Our code is open-sourced at: https://github.com/clemgris/IGDrivSim.git.

IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving

TL;DR

The paper tackles the imitation gap in imitation learning for autonomous driving, formalizing the gap as a mismatch between expert and imitator observations

. It introduces IGDrivSim, a benchmark atop the Waymax simulator that imposes partial observability to systematically study how BC-based IL performs when perception differs from human drivers. The key finding is that BC alone often fails to learn safe and effective policies under the imitation gap, but integrating BC with reinforcement learning through a simple penalty reward (PPO-based) significantly mitigates failures and improves safety metrics. By releasing open-source code and motion-prediction baselines, the work provides a practical tool for evaluating and developing perception-aware driving policies tailored to the sensors of self-driving cars.

Abstract

gap, leading to imitation learning failures. In this work, we introduce

, a benchmark built on top of the Waymax simulator, designed to investigate the effects of the imitation gap in learning autonomous driving policy from human expert demonstrations. Our experiments show that this perception gap between human experts and self-driving agents can hinder the learning of safe and effective driving behaviors. We further show that combining imitation with reinforcement learning, using a simple penalty reward for prohibited behaviors, effectively mitigates these failures. Our code is open-sourced at: https://github.com/clemgris/IGDrivSim.git.

IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving

TL;DR

Abstract

IGDrivSim: A Benchmark for the Imitation Gap in Autonomous Driving

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)