A Comprehensive Survey of Action Quality Assessment: Method and Benchmark
Kanglei Zhou, Ruizhi Cai, Liyuan Wang, Hubert P. H. Shum, Xiaohui Liang
TL;DR
This survey tackles fragmentation in Action Quality Assessment (AQA) by introducing a hierarchical taxonomy based on input modalities and by constructing a unified, open benchmark that integrates six datasets and seven evaluation metrics to compare both accuracy and computation. It analyzes over 150 papers to reveal how video-, skeleton-, and multi-modal approaches interrelate, and it discusses emerging trends, challenges, and future directions. The authors also highlight three task-specific AQA applications—semi-supervised, continual, and interpretable AQA—along with practical insights for cross-domain generalization and deployment. Overall, the work provides a standardized foundation to evaluate AQA methods, guiding future research toward robust, scalable, and interpretable action-quality assessment in diverse real-world contexts.
Abstract
Action Quality Assessment (AQA) quantitatively evaluates the quality of human actions, providing automated assessments that reduce biases in human judgment. Its applications span domains such as sports analysis, skill assessment, and medical care. Recent advances in AQA have introduced innovative methodologies, but similar methods often intertwine across different domains, highlighting the fragmented nature that hinders systematic reviews. In addition, the lack of a unified benchmark and limited computational comparisons hinder consistent evaluation and fair assessment of AQA approaches. In this work, we address these gaps by systematically analyzing over 150 AQA-related papers to develop a hierarchical taxonomy, construct a unified benchmark, and provide an in-depth analysis of current trends, challenges, and future directions. Our hierarchical taxonomy categorizes AQA methods based on input modalities (video, skeleton, multi-modal) and their specific characteristics, highlighting the evolution and interrelations across various approaches. To promote standardization, we present a unified benchmark, integrating diverse datasets to evaluate the assessment precision and computational efficiency. Finally, we review emerging task-specific applications and identify under-explored challenges in AQA, providing actionable insights into future research directions. This survey aims to deepen understanding of AQA progress, facilitate method comparison, and guide future innovations. The project web page can be found at https://ZhouKanglei.github.io/AQA-Survey.
