FineVQ: Fine-Grained User Generated Content Video Quality Assessment
Huiyu Duan, Qiang Hu, Jiarui Wang, Liu Yang, Zitong Xu, Lu Liu, Xiongkuo Min, Chunlei Cai, Tianxiao Ye, Xiaoyun Zhang, Guangtao Zhai
TL;DR
This work addresses the need for fine-grained assessment of user-generated content videos by introducing FineVD, a large-scale dataset with multi-dimensional quality labels, and FineVQ, a one-for-all VQA framework built on large multimodal models. FineVQ leverages an image encoder, a motion encoder, and a large language model, augmented with instruction tuning and LoRA adaptation to produce quality rating, scoring, and attribution across multiple dimensions. Extensive experiments show state-of-the-art performance on FineVD and several UGC-VQA benchmarks, along with strong cross-dataset generalization and meaningful ablation insights into motion features and parameter-efficient fine-tuning. The work offers a practical platform for improved video processing and recommendation in UGC ecosystems by providing rich, actionable quality annotations and a capable, adaptable VQA system.
Abstract
The rapid growth of user-generated content (UGC) videos has produced an urgent need for effective video quality assessment (VQA) algorithms to monitor video quality and guide optimization and recommendation procedures. However, current VQA models generally only give an overall rating for a UGC video, which lacks fine-grained labels for serving video processing and recommendation applications. To address the challenges and promote the development of UGC videos, we establish the first large-scale Fine-grained Video quality assessment Database, termed FineVD, which comprises 6104 UGC videos with fine-grained quality scores and descriptions across multiple dimensions. Based on this database, we propose a Fine-grained Video Quality assessment (FineVQ) model to learn the fine-grained quality of UGC videos, with the capabilities of quality rating, quality scoring, and quality attribution. Extensive experimental results demonstrate that our proposed FineVQ can produce fine-grained video-quality results and achieve state-of-the-art performance on FineVD and other commonly used UGC-VQA datasets.
