SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
Yuan Gao, Jin Song
TL;DR
<3-5 sentence high-level summary> SA-BENCH introduces the first large-scale interior spatial aesthetics benchmark with four dimensions: layout, harmony, lighting, and distortion, enabling precise, MOS-based evaluation of interior images. The authors then present SA-IQA, a multimodal, expert-guided IQA model built via supervised fine-tuning on SA-BENCH and fused across dimensions with a Bradley–Terry objective to yield a calibrated overall score. Extensive experiments show SA-IQA outperforms traditional, DL-based, and commercial MLLMs in PLCC/SRCC across all dimensions and demonstrate practical value when used as a reward in GRPO-based reinforcement learning and as a Best-of-N re-ranking signal. The work provides open-source data and tools to advance domain-specific IQA for interior design and AI-generated imagery.
Abstract
In recent years, Image Quality Assessment (IQA) for AI-generated images (AIGI) has advanced rapidly; however, existing methods primarily target portraits and artistic images, lacking a systematic evaluation of interior scenes. We introduce Spatial Aesthetics, a paradigm that assesses the aesthetic quality of interior images along four dimensions: layout, harmony, lighting, and distortion. We construct SA-BENCH, the first benchmark for spatial aesthetics, comprising 18,000 images and 50,000 precise annotations. Employing SA-BENCH, we systematically evaluate current IQA methodologies and develop SA-IQA, through MLLM fine-tuning and a multidimensional fusion approach, as a comprehensive reward framework for assessing spatial aesthetics. We apply SA-IQA to two downstream tasks: (1) serving as a reward signal integrated with GRPO reinforcement learning to optimize the AIGC generation pipeline, and (2) Best-of-N selection to filter high-quality images and improve generation quality. Experiments indicate that SA-IQA significantly outperforms existing methods on SA-BENCH, setting a new standard for spatial aesthetics evaluation. Code and dataset will be open-sourced to advance research and applications in this domain.
