PrevizWhiz: Combining Rough 3D Scenes and 2D Video to Guide Generative Video Previsualization
Erzhen Hu, Frederik Brudy, David Ledo, George Fitzmaurice, Fraser Anderson
TL;DR
PrevizWhiz tackles the gap between speed-focused storyboards and asset-heavy 3D previz by uniting rough 3D blocking with generative AI restyling and motion guidance. The system supports three fidelity levels and integrates a video playground for external-motion guidance, enabling rapid, multi-modal previews that communicate intent to stakeholders. A user study with 10 industry professionals shows improved iteration speed and cross-disciplinary collaboration, while revealing concerns about controllability, continuity, and the impact of AI on labor. The work demonstrates that AI-assisted previz can democratize creative exploration and collaboration in filmmaking, while underscoring the need for transparent provenance, attribution, and careful integration into existing pipelines.
Abstract
In pre-production, filmmakers and 3D animation experts must rapidly prototype ideas to explore a film's possibilities before fullscale production, yet conventional approaches involve trade-offs in efficiency and expressiveness. Hand-drawn storyboards often lack spatial precision needed for complex cinematography, while 3D previsualization demands expertise and high-quality rigged assets. To address this gap, we present PrevizWhiz, a system that leverages rough 3D scenes in combination with generative image and video models to create stylized video previews. The workflow integrates frame-level image restyling with adjustable resemblance, time-based editing through motion paths or external video inputs, and refinement into high-fidelity video clips. A study with filmmakers demonstrates that our system lowers technical barriers for film-makers, accelerates creative iteration, and effectively bridges the communication gap, while also surfacing challenges of continuity, authorship, and ethical consideration in AI-assisted filmmaking.
