MUGSQA: Novel Multi-Uncertainty-Based Gaussian Splatting Quality Assessment Method, Dataset, and Benchmarks
Tianang Chen, Jian Jin, Shilv Cai, Zhuangzi Li, Weisi Lin
TL;DR
This work addresses the perceptual quality evaluation gap for Gaussian Splatting (GS) reconstructions under input uncertainties by introducing a unified multi-distance subjective quality assessment (SQA) framework and the large-scale MUGSQA dataset. It combines synthetic, uncertainty-rich data generation with a crowdsourced subjective study across multiple viewing distances, enabling robust MOS estimates and two benchmarks: GS-based reconstruction robustness and metric evaluation. Analyses show current 2D IQA metrics inadequately reflect GS distortions, while certain GS methods (e.g., Mip-Splatting) demonstrate superior robustness; the study highlights the need for developing GS-specific quality metrics and distortion analyses. Overall, MUGSQA and the accompanying benchmarks provide a path toward standardized GSQA assessment with practical implications for fair comparison and method improvement in GS-based 3D reconstruction, with dataset and code to be released soon.
Abstract
Gaussian Splatting (GS) has recently emerged as a promising technique for 3D object reconstruction, delivering high-quality rendering results with significantly improved reconstruction speed. As variants continue to appear, assessing the perceptual quality of 3D objects reconstructed with different GS-based methods remains an open challenge. To address this issue, we first propose a unified multi-distance subjective quality assessment method that closely mimics human viewing behavior for objects reconstructed with GS-based methods in actual applications, thereby better collecting perceptual experiences. Based on it, we also construct a novel GS quality assessment dataset named MUGSQA, which is constructed considering multiple uncertainties of the input data. These uncertainties include the quantity and resolution of input views, the view distance, and the accuracy of the initial point cloud. Moreover, we construct two benchmarks: one to evaluate the robustness of various GS-based reconstruction methods under multiple uncertainties, and the other to evaluate the performance of existing quality assessment metrics. Our dataset and benchmark code will be released soon.
