Mask and Compress: Efficient Skeleton-based Action Recognition in Continual Learning
Matteo Mosconi, Andriy Sorokin, Aniello Panariello, Angelo Porrello, Jacopo Bonato, Marco Cotogni, Luigi Sabetta, Simone Calderara, Rita Cucchiara
TL;DR
This work tackles continual learning for skeleton-based action recognition, addressing catastrophic forgetting in class-incremental settings. It introduces CHARON, a memory-efficient framework that compresses skeletal samples in a replay buffer via uniform sampling with interval $s$ and linear interpolation for reconstruction, and trains a masked encoder–decoder (inspired by masked autoencoders) to jointly optimize recognition and reconstruction, followed by a lightweight linear probing phase to align the classifier. The approach leverages a STTFormer backbone and a memory-replay objective with reconstruction and logits/labels losses, achieving state-of-the-art results on Split NTU-60 and Split NTU-120 skeleton datasets while reducing memory and compute. These contributions advance practical online HAR by enabling high performance under tight memory budgets and varied masking settings, with potential extensions to more aggressive masking regimes in the future.
Abstract
The use of skeletal data allows deep learning models to perform action recognition efficiently and effectively. Herein, we believe that exploring this problem within the context of Continual Learning is crucial. While numerous studies focus on skeleton-based action recognition from a traditional offline perspective, only a handful venture into online approaches. In this respect, we introduce CHARON (Continual Human Action Recognition On skeletoNs), which maintains consistent performance while operating within an efficient framework. Through techniques like uniform sampling, interpolation, and a memory-efficient training stage based on masking, we achieve improved recognition accuracy while minimizing computational overhead. Our experiments on Split NTU-60 and the proposed Split NTU-120 datasets demonstrate that CHARON sets a new benchmark in this domain. The code is available at https://github.com/Sperimental3/CHARON.
