Continual Action Quality Assessment via Adaptive Manifold-Aligned Graph Regularization
Kanglei Zhou, Qingyi Pan, Xingxing Zhang, Hubert P. H. Shum, Frederick W. B. Li, Xiaohui Liang, Liyuan Wang
TL;DR
This paper formulates Continual AQA (CAQA) to address non-stationary score distributions in action quality assessment and shows that full-parameter fine-tuning (FPFT) is necessary but prone to overfitting and feature-manifold drift. It introduces MAGR++, a principled CL framework combining layer-adaptive FPFT, a Manifold Projector, and an Intra-Inter-Joint Graph Regularizer to enable robust continual regression with feature replay. The authors provide a theoretical forgetting bound and demonstrate state-of-the-art performance on four CAQA benchmarks across three datasets, with offline SRCC gains of about 3.6% and online gains of about 12.2% on average. The work offers practical, memory-efficient strategies for adapting AQA models to evolving distributions and lays groundwork for broader continual learning in fine-grained video understanding, including potential multi-modal extensions and real-time deployment.
Abstract
Action Quality Assessment (AQA) quantifies human actions in videos, supporting applications in sports scoring, rehabilitation, and skill evaluation. A major challenge lies in the non-stationary nature of quality distributions in real-world scenarios, which limits the generalization ability of conventional methods. We introduce Continual AQA (CAQA), which equips AQA with Continual Learning (CL) capabilities to handle evolving distributions while mitigating catastrophic forgetting. Although parameter-efficient fine-tuning of pretrained models has shown promise in CL for image classification, we find it insufficient for CAQA. Our empirical and theoretical analyses reveal two insights: (i) Full-Parameter Fine-Tuning (FPFT) is necessary for effective representation learning; yet (ii) uncontrolled FPFT induces overfitting and feature manifold shift, thereby aggravating forgetting. To address this, we propose Adaptive Manifold-Aligned Graph Regularization (MAGR++), which couples backbone fine-tuning that stabilizes shallow layers while adapting deeper ones with a two-step feature rectification pipeline: a manifold projector to translate deviated historical features into the current representation space, and a graph regularizer to align local and global distributions. We construct four CAQA benchmarks from three datasets with tailored evaluation protocols and strong baselines, enabling systematic cross-dataset comparison. Extensive experiments show that MAGR++ achieves state-of-the-art performance, with average correlation gains of 3.6% offline and 12.2% online over the strongest baseline, confirming its robustness and effectiveness. Our code is available at https://github.com/ZhouKanglei/MAGRPP.
