Mask-Based Window-Level Insider Threat Detection for Campaign Discovery
Jericho Cain, Hayden Beadles
TL;DR
This paper tackles unsupervised window-level insider threat detection in sparse enterprise audit data and its extension to campaign discovery. It introduces a dual-channel mask-value convolutional autoencoder that separately models activity presence and magnitude, yielding stronger window-level precision–recall performance than standard baselines, with a reported PR-AUC around $0.71$ and potential zero-false-alarm operating points. The authors demonstrate that campaign detection can be effectively achieved by sparse aggregation of high-confidence window-level scores, using simple top-$k$ pooling over six-day sequences to attain PR-AUCs up to approximately $0.835$ (ROC-AUC ≈ $0.938$) without explicit temporal trajectory modeling. Collectively, the work provides a practical, high-precision two-stage approach for insider threat monitoring: precise window-level alerts complemented by scalable campaign discovery through aggregation, with robust performance across multiple attack scenarios on the CERT r4.2 dataset.
Abstract
User and Entity Behavior Analytics (UEBA) systems commonly detect insider threats by scoring fixed time windows of user activity for anomalous behavior. While this window-level paradigm has proven effective for identifying sharp behavioral deviations, it remains unclear how much information about longer-running attack campaigns is already present within individual windows, and how such information can be leveraged for campaign discovery. In this work, we study unsupervised window-level insider threat detection on the CERT r4.2 dataset and show that explicitly separating activity presence from activity magnitude yields substantial performance gains. We introduce a dual-channel convolutional autoencoder that reconstructs both a binary activity mask and corresponding activity values, allowing the model to focus representational capacity on sparse behavioral structure rather than dense inactive baselines. Across multiday attack campaigns lasting between one and seven days, the proposed approach achieves a window-level precision-recall AUC of 0.71, substantially exceeding standard unsupervised autoencoder baselines and enabling high-precision operating points with zero false alarms.
