FlowIBR: Leveraging Pre-Training for Efficient Neural Image-Based Rendering of Dynamic Scenes
Marcel Büsching, Josef Bengtson, David Nilsson, Mårten Björkman
TL;DR
FlowIBR tackles monocular dynamic novel-view synthesis by decoupling scene dynamics from rendering via a per-scene learned scene flow, which bends camera rays to align with moving content. It combines a pre-trained generalizable static rendering backbone (GNT) with a lightweight scene-flow field implemented on a permutohedral lattice, enabling dynamic scenes to be rendered with a static IBR pipeline. The method achieves substantial reductions in per-scene optimization time (about $1.5$ hours on a single GPU) while delivering competitive rendering quality on the Nvidia Dynamic Scenes Dataset, as validated against state-of-the-art baselines and supported by an extensive ablation study. This approach lowers hardware barriers for dynamic-view synthesis and opens avenues for faster, scalable monocular rendering in dynamic environments.
Abstract
We introduce FlowIBR, a novel approach for efficient monocular novel view synthesis of dynamic scenes. Existing techniques already show impressive rendering quality but tend to focus on optimization within a single scene without leveraging prior knowledge, resulting in long optimization times per scene. FlowIBR circumvents this limitation by integrating a neural image-based rendering method, pre-trained on a large corpus of widely available static scenes, with a per-scene optimized scene flow field. Utilizing this flow field, we bend the camera rays to counteract the scene dynamics, thereby presenting the dynamic scene as if it were static to the rendering network. The proposed method reduces per-scene optimization time by an order of magnitude, achieving comparable rendering quality to existing methods -- all on a single consumer-grade GPU.
