Drantal-NeRF: Diffusion-Based Restoration for Anti-aliasing Neural Radiance Field
Ganlin Yang, Kaidong Zhang, Jingjing Fu, Dong Liu
TL;DR
Drantal-NeRF addresses aliasing in Neural Radiance Field renderings by introducing a diffusion-based restoration pipeline. It employs a two-stage training strategy: first finetuning a pretrained diffusion model conditioned on aliased NeRF outputs while training NeRF, then refining with a controllable feature-wrapping module and adversarial VAE-decoder training to enforce high-fidelity, multi-view-consistent restorations, all in a NeRF-agnostic framework. The approach yields substantial improvements on large-scale urban (MatrixCity) and unbounded 360-degree (MipNeRF-360) datasets, evidenced by quantitative gains in $PSNR$, $SSIM$, and $LPIPS$ and by crisper, more texture-rich visuals. Overall, Drantal demonstrates that diffusion priors can serve as a robust, general post-processing tool for anti-aliasing in NeRF backbones, potentially guiding future restoration-oriented 3D pipelines.
Abstract
Aliasing artifacts in renderings produced by Neural Radiance Field (NeRF) is a long-standing but complex issue in the field of 3D implicit representation, which arises from a multitude of intricate causes and was mitigated by designing more advanced but complex scene parameterization methods before. In this paper, we present a Diffusion-based restoration method for anti-aliasing Neural Radiance Field (Drantal-NeRF). We consider the anti-aliasing issue from a low-level restoration perspective by viewing aliasing artifacts as a kind of degradation model added to clean ground truths. By leveraging the powerful prior knowledge encapsulated in diffusion model, we could restore the high-realism anti-aliasing renderings conditioned on aliased low-quality counterparts. We further employ a feature-wrapping operation to ensure multi-view restoration consistency and finetune the VAE decoder to better adapt to the scene-specific data distribution. Our proposed method is easy to implement and agnostic to various NeRF backbones. We conduct extensive experiments on challenging large-scale urban scenes as well as unbounded 360-degree scenes and achieve substantial qualitative and quantitative improvements.
