Generative AI for Autonomous Driving: A Review
Katharina Winter, Abhishek Vivekanandan, Rupert Polley, Yinzhe Shen, Christian Schlauch, Mohamed-Khalil Bouzidi, Bojan Derajic, Natalie Grabowsky, Annajoyce Mariani, Dennis Rochau, Giovanni Lucente, Harsh Yadav, Firas Mualla, Adam Molin, Sebastian Bernhard, Christian Wirth, Ömer Şahin Taş, Nadja Klein, Fabian B. Flohr, Hanno Gottschalk
TL;DR
Generative AI for Autonomous Driving surveys a broad set of GenAI models and hybrid methods applied to driving tasks, including static map and dynamic scenario generation, trajectory forecasting, and motion planning. It emphasizes the complementary roles of diffusion models, VAEs, GANs, normalizing flows, energy-based models, and transformers, and discusses conditioning, uncertainty quantification, and classical learning strategies to stabilize training. The review also covers the autonomous driving stack, data modalities, world models, and end-to-end driving, highlighting open challenges in safety, interpretability, and real-time feasibility, while offering practical recommendations and directions for future research. The work underscores the potential of hybrid GenAI-plus-traditional approaches and LLM-enabled planning, but calls for robust evaluation, domain-gap assessment, and efficient deployment on edge devices to translate advances into market-ready AD systems.
Abstract
Generative AI (GenAI) is rapidly advancing the field of Autonomous Driving (AD), extending beyond traditional applications in text, image, and video generation. We explore how generative models can enhance automotive tasks, such as static map creation, dynamic scenario generation, trajectory forecasting, and vehicle motion planning. By examining multiple generative approaches ranging from Variational Autoencoder (VAEs) over Generative Adversarial Networks (GANs) and Invertible Neural Networks (INNs) to Generative Transformers (GTs) and Diffusion Models (DMs), we highlight and compare their capabilities and limitations for AD-specific applications. Additionally, we discuss hybrid methods integrating conventional techniques with generative approaches, and emphasize their improved adaptability and robustness. We also identify relevant datasets and outline open research questions to guide future developments in GenAI. Finally, we discuss three core challenges: safety, interpretability, and realtime capabilities, and present recommendations for image generation, dynamic scenario generation, and planning.
