UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection

Yang Xiao; Rohan Kumar Das

UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection

Yang Xiao, Rohan Kumar Das

TL;DR

This paper tackles the problem of evolving sound event detection (SED) systems that must incrementally learn new sound classes without retraining from scratch. It proposes UCIL, an unsupervised class incremental learning framework that isolates learning of new classes (independent learning), preserves past knowledge through two distillation losses $L_{dis}^P$ and $L_{dis}^F$, and leverages unlabeled data via sample selection and a balanced rehearsal memory, with a learning objective that includes $L_{cls}$, $L_{dis}^P$, $L_{dis}^F$, and $L_{dis}^U$. The method is evaluated on the DCASE 2023 Task 4A dataset under two-task and four-task settings, showing competitive PSDS1/PSDS2 scores and clear gains over baseline continual learning approaches, particularly for reducing class confusion (PSDS2) as the number of tasks increases. These results demonstrate that UCIL can maintain detection performance while expanding its repertoire of sound events, offering a practical path toward real-world, dynamic SED systems that learn from both labeled and unlabeled data.

Abstract

This work explores class-incremental learning (CIL) for sound event detection (SED), advancing adaptability towards real-world scenarios. CIL's success in domains like computer vision inspired our SED-tailored method, addressing the unique challenges of diverse and complex audio environments. Our approach employs an independent unsupervised learning framework with a distillation loss function to integrate new sound classes while preserving the SED model consistency across incremental tasks. We further enhance this framework with a sample selection strategy for unlabeled data and a balanced exemplar update mechanism, ensuring varied and illustrative sound representations. Evaluating various continual learning methods on the DCASE 2023 Task 4 dataset, we find that our research offers insights into each method's applicability for real-world SED systems that can have newly added sound classes. The findings also delineate future directions of CIL in dynamic audio settings.

UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection

TL;DR

and

, and leverages unlabeled data via sample selection and a balanced rehearsal memory, with a learning objective that includes

, and

. The method is evaluated on the DCASE 2023 Task 4A dataset under two-task and four-task settings, showing competitive PSDS1/PSDS2 scores and clear gains over baseline continual learning approaches, particularly for reducing class confusion (PSDS2) as the number of tasks increases. These results demonstrate that UCIL can maintain detection performance while expanding its repertoire of sound events, offering a practical path toward real-world, dynamic SED systems that learn from both labeled and unlabeled data.

Abstract

Paper Structure (15 sections, 4 equations, 1 figure, 3 tables)

This paper contains 15 sections, 4 equations, 1 figure, 3 tables.

Introduction
Class Incremental Learning
Class Incremental Learning for SED
Proposed UCIL Method
Independent Learning to Update Model by New Data
Knowledge Distillation from Existing to New
Unsupervised Learning with Sample Selection
Balanced Memory Update Method
Experiment Setting
Dataset, Task Setting and Performance Metric
Implementation Details and Reference Baselines
Results and Analysis
Results for Two-task and Four-Task Settings
Ablation Study
Conclusion

Figures (1)

Figure 1: Block diagram of the proposed UCIL approach for task $t_i$. All training data for task $t_i$ include the new data $D_i$, the rehearsal data $x^e_i$, and the unlabeled data $x^u_i$ as the input. $o_{ext}$, $o_{cur}$, and $o_{u}$ present the prediction of the model $M_i$ for the classes of the existing task, classes of the current learning task, and the unlabeled data.

UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection

TL;DR

Abstract

UCIL: An Unsupervised Class Incremental Learning Approach for Sound Event Detection

Authors

TL;DR

Abstract

Table of Contents

Figures (1)