BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
Catherine Weaver, Chen Tang, Ce Hao, Kenta Kawamoto, Masayoshi Tomizuka, Wei Zhan
TL;DR
BeTAIL addresses imitation learning for autonomous racing where human decisions are complex and non-Markovian. It pretrains a Behavior Transformer on offline demonstrations and then refines it online with a residual Adversarial Imitation Learning policy to correct distribution shifts. Across Lago Maggiore, Dragon Tail, and Mount Panorama, BeTAIL achieves faster, smoother laps and better stability than BeT or AIL alone, including transfer to unseen tracks. The approach demonstrates that combining sequence modeling with occupancy-matching imitation can robustly recover non-Markovian human-like decision-making in a high-fidelity racing simulator.
Abstract
Imitation learning learns a policy from demonstrations without requiring hand-designed reward functions. In many robotic tasks, such as autonomous racing, imitated policies must model complex environment dynamics and human decision-making. Sequence modeling is highly effective in capturing intricate patterns of motion sequences but struggles to adapt to new environments or distribution shifts that are common in real-world robotics tasks. In contrast, Adversarial Imitation Learning (AIL) can mitigate this effect, but struggles with sample inefficiency and handling complex motion patterns. Thus, we propose BeTAIL: Behavior Transformer Adversarial Imitation Learning, which combines a Behavior Transformer (BeT) policy from human demonstrations with online AIL. BeTAIL adds an AIL residual policy to the BeT policy to model the sequential decision-making process of human experts and correct for out-of-distribution states or shifts in environment dynamics. We test BeTAIL on three challenges with expert-level demonstrations of real human gameplay in Gran Turismo Sport. Our proposed residual BeTAIL reduces environment interactions and improves racing performance and stability, even when the BeT is pretrained on different tracks than downstream learning. Videos and code available at: https://sites.google.com/berkeley.edu/BeTAIL/home.
