Exploring Dynamic Novel View Synthesis Technologies for Cinematography
Adrian Azzarelli, Nantheera Anantrasirichai, David R Bull
TL;DR
The paper investigates dynamic novel view synthesis for cinematography by contrasting NeRF and Gaussian Splatting (GS) representations, and by evaluating dynamic 4D approaches that separate canonical fields from motion via deformation fields or low-rank planes. It presents a spectrum of dynamic representations, from continuous deformation to key-frame interpolation and 4D primitives, and discusses data acquisition strategies, including depth initialization and COLMAP-based geometry. A production-style exhibition demonstrates three NVS configurations on a short scene, highlighting how dynamic NVS enables smooth camera trajectories, lighting variation, and non-rigid motion reconstruction, while also revealing challenges like temporal jitter and pose misalignment. The work underscores dynamic NVS as a practical tool for cinematography, capable of delivering high-quality background and motion effects with modest image counts, and points to directions such as improved calibration, dynamic masking, and depth-informed initialization to further close the gap with real-world shooting constraints.
Abstract
Novel view synthesis (NVS) has shown significant promise for applications in cinematographic production, particularly through the exploitation of Neural Radiance Fields (NeRF) and Gaussian Splatting (GS). These methods model real 3D scenes, enabling the creation of new shots that are challenging to capture in the real world due to set topology or expensive equipment requirement. This innovation also offers cinematographic advantages such as smooth camera movements, virtual re-shoots, slow-motion effects, etc. This paper explores dynamic NVS with the aim of facilitating the model selection process. We showcase its potential through a short montage filmed using various NVS models.
