Influence of Video Dynamics on EEG-based Single-Trial Video Target Surveillance System
Heon-Gyu Kwak, Sung-Jin Kim, Hyeon-Taek Han, Ji-Hoon Jeong, Seong-Whan Lee
TL;DR
The paper addresses the challenge of limited hostile-target data in video surveillance by proposing an EEG-based single-trial target detection framework to complement computer-vision systems. It introduces a hierarchical DeepConvNet architecture with three-class outputs (non-target, true-target, error-target), trained with data augmentation and subject-specific calibration. In online asynchronous experiments, it achieves a mean macro F-beta of 0.6522 on Video1, while performance drops for videos with dynamic camera movement and weather, suggesting reliance on passive visual features driven by stimulus dynamics. ERP analysis shows no strong discriminative temporal patterns, and saliency maps point to central/occipital channels, indicating the model leverages visual-perception cues; the study highlights the need for careful stimulus design to realize robust EEG-based surveillance augmentation.
Abstract
Target detection models are one of the widely used deep learning-based applications for reducing human efforts on video surveillance and patrol. However, the application of conventional computer vision-based target detection models in military usage can result in limited performance, due to the lack of sample data of hostile targets. In this paper, we present the possibility of the electroencephalography-based video target detection model, which could be applied as a supportive module of the military video surveillance system. The proposed framework and detection model showed prospective performance achieving a mean macro F-beta of 0.6522 with asynchronous real-time data from five subjects, in a certain video stimulus, but not on some video stimuli. By analyzing the results of experiments using each video stimulus, we present the factors that would affect the performance of electroencephalography-based video target detection models.
