MultiPlaneNeRF: Neural Radiance Field with Non-Trainable Representation
Dominik Zimny, Artur Kasymov, Adam Kania, Jacek Tabor, Maciej Zięba, Marcin Mazur, Przemysław Spurek
TL;DR
MultiPlaneNeRF addresses NeRF’s per-object training burden and limited generalization by replacing trainable 3D representations with fixed 2D image inputs and a small implicit decoder, learned over a large dataset. The approach achieves competitive view synthesis with far fewer trainable parameters and demonstrates generalization to unseen objects and cross-class interpolation, while enabling an interpretable GAN component (MultiPlaneGAN) for integration with broader generative models. The work highlights a practical path toward scalable, generalizable neural rendering by decoupling representation from the learnable decoder and leveraging fixed image bases. This offers a tractable alternative to full 3D supervision and opens avenues for efficient 3D-aware generation in complex pipelines.
Abstract
NeRF is a popular model that efficiently represents 3D objects from 2D images. However, vanilla NeRF has some important limitations. NeRF must be trained on each object separately. The training time is long since we encode the object's shape and color in neural network weights. Moreover, NeRF does not generalize well to unseen data. In this paper, we present MultiPlaneNeRF -- a model that simultaneously solves the above problems. Our model works directly on 2D images. We project 3D points on 2D images to produce non-trainable representations. The projection step is not parametrized and a very shallow decoder can efficiently process the representation. Furthermore, we can train MultiPlaneNeRF on a large data set and force our implicit decoder to generalize across many objects. Consequently, we can only replace the 2D images (without additional training) to produce a NeRF representation of the new object. In the experimental section, we demonstrate that MultiPlaneNeRF achieves results comparable to state-of-the-art models for synthesizing new views and has generalization properties. Additionally, MultiPlane decoder can be used as a component in large generative models like GANs.
