SIAVC: Semi-Supervised Framework for Industrial Accident Video Classification
Zuoyong Li, Qinghua Lin, Haoyi Fan, Tiesong Zhao, David Zhang
TL;DR
This work tackles industrial accident video classification under limited labeled data by proposing SIAVC, a semi-supervised framework that combines a Super Augmentation Block (SAB) and a Video Cross-set Augmentation Module (VCAM). SAB re-augments well-learned strongly augmented samples using Gaussian noise and random masking guided by historical losses, while VCAM expands training data by interpolating high-confidence unlabeled samples with labeled ones to generate diverse pseudo-labels. The authors introduce the ECA9 dataset for hub-level express center accidents and demonstrate that SIAVC outperforms state-of-the-art semi-supervised methods on both ECA9 and Fire Detection benchmarks, with ablations confirming the benefits of SAB and VCAM. They also provide detailed implementation and evaluation, and plan to release the code and dataset to advance industrial safety analytics. Overall, SIAVC offers a practical, high-performance approach to semi-supervised video accident classification with real-world applicability and impact.
Abstract
Semi-supervised learning suffers from the imbalance of labeled and unlabeled training data in the video surveillance scenario. In this paper, we propose a new semi-supervised learning method called SIAVC for industrial accident video classification. Specifically, we design a video augmentation module called the Super Augmentation Block (SAB). SAB adds Gaussian noise and randomly masks video frames according to historical loss on the unlabeled data for model optimization. Then, we propose a Video Cross-set Augmentation Module (VCAM) to generate diverse pseudo-label samples from the high-confidence unlabeled samples, which alleviates the mismatch of sampling experience and provides high-quality training data. Additionally, we construct a new industrial accident surveillance video dataset with frame-level annotation, namely ECA9, to evaluate our proposed method. Compared with the state-of-the-art semi-supervised learning based methods, SIAVC demonstrates outstanding video classification performance, achieving 88.76\% and 89.13\% accuracy on ECA9 and Fire Detection datasets, respectively. The source code and the constructed dataset ECA9 will be released in \url{https://github.com/AlchemyEmperor/SIAVC}.
