ASSR-NeRF: Arbitrary-Scale Super-Resolution on Voxel Grid for High-Quality Radiance Fields Reconstruction
Ding-Jiun Huang, Zi-Ting Chou, Yu-Chiang Frank Wang, Cheng Sun
TL;DR
ASSR-NeRF tackles the challenge of high-quality SRNVS from LR training views by introducing a 3D arbitrageable SR framework. It comprises a voxel-based distilled feature field that carries 2D SR priors into 3D space and a generalizable VoxelGridSR module that applies density-distance-aware self-attention to refine radiance fields at arbitrary scales. The approach enables multi-view consistent SR without requiring HR reference views for each scene, and is trained across multiple scenes to generalize to unseen data. Experimental results on Synthetic-NeRF and BlendedMVS show state-of-the-art SRNVS performance, with clear improvements in texture detail, edge sharpness, and geometric consistency over existing NeRF-based and image SR methods, while also highlighting the method’s limitations in rendering speed and the need for robust MV-benchmarks.
Abstract
NeRF-based methods reconstruct 3D scenes by building a radiance field with implicit or explicit representations. While NeRF-based methods can perform novel view synthesis (NVS) at arbitrary scale, the performance in high-resolution novel view synthesis (HRNVS) with low-resolution (LR) optimization often results in oversmoothing. On the other hand, single-image super-resolution (SR) aims to enhance LR images to HR counterparts but lacks multi-view consistency. To address these challenges, we propose Arbitrary-Scale Super-Resolution NeRF (ASSR-NeRF), a novel framework for super-resolution novel view synthesis (SRNVS). We propose an attention-based VoxelGridSR model to directly perform 3D super-resolution (SR) on the optimized volume. Our model is trained on diverse scenes to ensure generalizability. For unseen scenes trained with LR views, we then can directly apply our VoxelGridSR to further refine the volume and achieve multi-view consistent SR. We demonstrate quantitative and qualitatively that the proposed method achieves significant performance in SRNVS.
