Invertible Neural Warp for NeRF
Shin-Fang Chng, Ravi Garg, Hemanth Saratchandran, Simon Lucey
TL;DR
This work tackles the challenging problem of jointly optimizing camera poses and NeRF by moving away from explicit SE(3) pose parameterizations to an overparameterized, invertible ray-warp representation. It introduces an explicit Invertible Neural Network (INN) to model rigid ray transformations, coupled with a geometry-informed rigidity prior to preserve bijectivity and guide optimization. Across 2D planar tests, LLFF forward-facing scenes, and 360° DTU data, the INN-based approach yields substantial pose-accuracy gains (often exceeding 50% relative to SE(3)-based baselines) and improved high-fidelity reconstructions, outperforming BARF and L2G baselines. The results demonstrate that enforcing invertibility and leveraging homeomorphisms in warp representations can significantly enhance convergence and robustness in joint NeRF pose estimation and view synthesis, with implications for more reliable 3D reconstruction in challenging settings.
Abstract
This paper tackles the simultaneous optimization of pose and Neural Radiance Fields (NeRF). Departing from the conventional practice of using explicit global representations for camera pose, we propose a novel overparameterized representation that models camera poses as learnable rigid warp functions. We establish that modeling the rigid warps must be tightly coupled with constraints and regularization imposed. Specifically, we highlight the critical importance of enforcing invertibility when learning rigid warp functions via neural network and propose the use of an Invertible Neural Network (INN) coupled with a geometry-informed constraint for this purpose. We present results on synthetic and real-world datasets, and demonstrate that our approach outperforms existing baselines in terms of pose estimation and high-fidelity reconstruction due to enhanced optimization convergence.
