NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods
Qiang Qu, Hanxue Liang, Xiaoming Chen, Yuk Ying Chung, Yiran Shen
TL;DR
This work introduces NeRF-NQA, the first no-reference quality assessment framework designed for scenes generated by Neural View Synthesis and NeRF variants with dense viewpoints. It combines a viewwise module that captures spatial fidelity and inter-view consistency with a pointwise module that encodes angular quality through the Pointwise Normalized Spherical Gradient (PNSG) features, fused by an MLP to produce comprehensive quality scores. Across Fieldwork, LLFF, and Lab datasets, NeRF-NQA outperforms 23 baseline QA methods in RMSE, SRCC, PLCC, and OR, and demonstrates strong cross-dataset generalization across diverse NVS methods. Limitations include dependence on COLMAP-derived sparse points and focus on front-facing scenes; future work will broaden coverage to 360-degree content and additional NVS techniques. The method offers a practical, no-reference tool that aligns quality assessments more closely with human perception for densely-sampled NVS scenes.
Abstract
Neural View Synthesis (NVS) has demonstrated efficacy in generating high-fidelity dense viewpoint videos using a image set with sparse views. However, existing quality assessment methods like PSNR, SSIM, and LPIPS are not tailored for the scenes with dense viewpoints synthesized by NVS and NeRF variants, thus, they often fall short in capturing the perceptual quality, including spatial and angular aspects of NVS-synthesized scenes. Furthermore, the lack of dense ground truth views makes the full reference quality assessment on NVS-synthesized scenes challenging. For instance, datasets such as LLFF provide only sparse images, insufficient for complete full-reference assessments. To address the issues above, we propose NeRF-NQA, the first no-reference quality assessment method for densely-observed scenes synthesized from the NVS and NeRF variants. NeRF-NQA employs a joint quality assessment strategy, integrating both viewwise and pointwise approaches, to evaluate the quality of NVS-generated scenes. The viewwise approach assesses the spatial quality of each individual synthesized view and the overall inter-views consistency, while the pointwise approach focuses on the angular qualities of scene surface points and their compound inter-point quality. Extensive evaluations are conducted to compare NeRF-NQA with 23 mainstream visual quality assessment methods (from fields of image, video, and light-field assessment). The results demonstrate NeRF-NQA outperforms the existing assessment methods significantly and it shows substantial superiority on assessing NVS-synthesized scenes without references. An implementation of this paper are available at https://github.com/VincentQQu/NeRF-NQA.
