IReNe: Instant Recoloring of Neural Radiance Fields
Alessio Mazzucchelli, Adrian Garcia-Garcia, Elena Garces, Fernando Rivas-Manzaneque, Francesc Moreno-Noguer, Adrian Penate-Sanchez
TL;DR
IReNe addresses the challenge of editing color in Neural Radiance Fields with near real-time feedback by retraining only the last layer of the color MLP and introducing a trainable 3D segmentation to constrain edits to targeted regions. A key contribution is automatically classifying last-layer neurons into view-dependent and diffuse types, freezing the former to preserve view-dependent shading while fine-tuning the latter to propagate color edits consistently across views. A lightweight 3D segmentation module enables boundary-aware edits while maintaining speed, with convergence typically under 5 seconds. The approach yields significant speedups (5x–500x) and improved boundary fidelity over state-of-the-art recoloring methods, demonstrated on a new dataset with edited colors across multiple NeRF scenes, enabling interactive editing pipelines.
Abstract
Advances in NERFs have allowed for 3D scene reconstructions and novel view synthesis. Yet, efficiently editing these representations while retaining photorealism is an emerging challenge. Recent methods face three primary limitations: they're slow for interactive use, lack precision at object boundaries, and struggle to ensure multi-view consistency. We introduce IReNe to address these limitations, enabling swift, near real-time color editing in NeRF. Leveraging a pre-trained NeRF model and a single training image with user-applied color edits, IReNe swiftly adjusts network parameters in seconds. This adjustment allows the model to generate new scene views, accurately representing the color changes from the training image while also controlling object boundaries and view-specific effects. Object boundary control is achieved by integrating a trainable segmentation module into the model. The process gains efficiency by retraining only the weights of the last network layer. We observed that neurons in this layer can be classified into those responsible for view-dependent appearance and those contributing to diffuse appearance. We introduce an automated classification approach to identify these neuron types and exclusively fine-tune the weights of the diffuse neurons. This further accelerates training and ensures consistent color edits across different views. A thorough validation on a new dataset, with edited object colors, shows significant quantitative and qualitative advancements over competitors, accelerating speeds by 5x to 500x.
