FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis

Shijie Chen; Peixi Peng

FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis

Shijie Chen, Peixi Peng

TL;DR

FreeGen tackles the challenge of free-viewpoint driving scene synthesis from a single trajectory by coupling a fast feed-forward 3D Gaussian Splatting reconstruction with a geometry-aware diffusion refinement. A geometry-conditioned diffusion module preserves structural fidelity while enabling realistic extrapolation beyond observed viewpoints, and a closed-loop co-training scheme distills generative priors back into the reconstruction path. The approach yields state-of-the-art results on off-trajectory syntheses, with improved temporal coherence and visual realism, while remaining scalable and annotation-light. Overall, FreeGen offers a practical pathway to high-fidelity, consistent free-viewpoint driving scenes suitable for closed-loop simulation and pretraining at scale.

Abstract

Closed-loop simulation and scalable pre-training for autonomous driving require synthesizing free-viewpoint driving scenes. However, existing datasets and generative pipelines rarely provide consistent off-trajectory observations, limiting large-scale evaluation and training. While recent generative models demonstrate strong visual realism, they struggle to jointly achieve interpolation consistency and extrapolation realism without per-scene optimization. To address this, we propose FreeGen, a feed-forward reconstruction-generation co-training framework for free-viewpoint driving scene synthesis. The reconstruction model provides stable geometric representations to ensure interpolation consistency, while the generation model performs geometry-aware enhancement to improve realism at unseen viewpoints. Through co-training, generative priors are distilled into the reconstruction model to improve off-trajectory rendering, and the refined geometry in turn offers stronger structural guidance for generation. Experiments demonstrate that FreeGen achieves state-of-the-art performance for free-viewpoint driving scene synthesis.

FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis

TL;DR

Abstract

FreeGen: Feed-Forward Reconstruction-Generation Co-Training for Free-Viewpoint Driving Scene Synthesis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)