Data-Free Class-Incremental Gesture Recognition with Prototype-Guided Pseudo Feature Replay
Hongsong Wang, Ao Sun, Jie Gui, Liang Wang
TL;DR
This paper tackles data-free class-incremental gesture recognition, addressing privacy and memory constraints while mitigating catastrophic forgetting. It introduces the Prototype-Guided Pseudo Feature Replay (PGPFR) framework, comprising PFGBP for online pseudo-feature generation with batch prototypes, Variational Prototype Replay (VPR) to align old-class prototypes with classifier weights using covariances, Truncated Cross-Entropy (TCE) to handle domain differences for new classes, and Continual Classifier Re-Training (CCRT) to stabilize features by keeping the backbone fixed. The approach achieves state-of-the-art results on SHREC 2017 3D and EgoGesture 3D, with substantial gains in mean Global Accuracy and reductions in IFM compared to data-free baselines, while maintaining data privacy and low spatial complexity. These findings advance practical data-free continual learning for gesture recognition and highlight the importance of prototype-aware replay and dedicated new-class optimization for robust, scalable open-set recognition in 3D gesture domains.
Abstract
Gesture recognition is an important research area in the field of computer vision. Most gesture recognition efforts focus on close-set scenarios, thereby limiting the capacity to effectively handle unseen or novel gestures. We aim to address class-incremental gesture recognition, which entails the ability to accommodate new and previously unseen gestures over time. Specifically, we introduce a Prototype-Guided Pseudo Feature Replay (PGPFR) framework for data-free class-incremental gesture recognition. This framework comprises four components: Pseudo Feature Generation with Batch Prototypes (PFGBP), Variational Prototype Replay (VPR) for old classes, Truncated Cross-Entropy (TCE) for new classes, and Continual Classifier Re-Training (CCRT). To tackle the issue of catastrophic forgetting, the PFGBP dynamically generates a diversity of pseudo features in an online manner, leveraging class prototypes of old classes along with batch class prototypes of new classes. Furthermore, the VPR enforces consistency between the classifier's weights and the prototypes of old classes, leveraging class prototypes and covariance matrices to enhance robustness and generalization capabilities. The TCE mitigates the impact of domain differences of the classifier caused by pseudo features. Finally, the CCRT training strategy is designed to prevent overfitting to new classes and ensure the stability of features extracted from old classes. Extensive experiments conducted on two widely used gesture recognition datasets, namely SHREC 2017 3D and EgoGesture 3D, demonstrate that our approach outperforms existing state-of-the-art methods by 11.8\% and 12.8\% in terms of mean global accuracy, respectively. The code is available on https://github.com/sunao-101/PGPFR-3/.
