Causal Composition Diffusion Model for Closed-loop Traffic Generation

Haohong Lin; Xin Huang; Tung Phan-Minh; David S. Hayden; Huan Zhang; Ding Zhao; Siddhartha Srinivasa; Eric M. Wolff; Hongge Chen

Causal Composition Diffusion Model for Closed-loop Traffic Generation

Haohong Lin, Xin Huang, Tung Phan-Minh, David S. Hayden, Huan Zhang, Ding Zhao, Siddhartha Srinivasa, Eric M. Wolff, Hongge Chen

TL;DR

This work addresses the challenge of generating traffic scenarios for autonomous vehicle safety that are simultaneously realistic and controllable over long horizons. It introduces CCDiff, a structure-guided diffusion model built on a Constrained Factored MDP and a learned Decision Causal Graph, augmented with Realism Constrained Score Matching and causal composition guidance. Empirical results on nuScenes and closed-loop simulators show CCDiff achieving superior realism and controllability compared with SOTA baselines, with improved metrics such as collision rate, off-road rate, FDE, and comfort. The approach provides interpretable causal structure for traffic reasoning and offers a scalable framework for safety-critical scenario generation, with potential for integration of larger foundation models and causal benchmarks.

Abstract

Simulation is critical for safety evaluation in autonomous driving, particularly in capturing complex interactive behaviors. However, generating realistic and controllable traffic scenarios in long-tail situations remains a significant challenge. Existing generative models suffer from the conflicting objective between user-defined controllability and realism constraints, which is amplified in safety-critical contexts. In this work, we introduce the Causal Compositional Diffusion Model (CCDiff), a structure-guided diffusion framework to address these challenges. We first formulate the learning of controllable and realistic closed-loop simulation as a constrained optimization problem. Then, CCDiff maximizes controllability while adhering to realism by automatically identifying and injecting causal structures directly into the diffusion process, providing structured guidance to enhance both realism and controllability. Through rigorous evaluations on benchmark datasets and in a closed-loop simulator, CCDiff demonstrates substantial gains over state-of-the-art approaches in generating realistic and user-preferred trajectories. Our results show CCDiff's effectiveness in extracting and leveraging causal structures, showing improved closed-loop performance based on key metrics such as collision rate, off-road rate, FDE, and comfort.

Causal Composition Diffusion Model for Closed-loop Traffic Generation

TL;DR

Abstract

Causal Composition Diffusion Model for Closed-loop Traffic Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (29)

Theorems & Definitions (2)