Accurate and Scalable Detection and Investigation of Cyber Persistence Threats
Qi Liu, Muhammad Shoaib, Mati Ur Rehman, Kaibin Bao, Veit Hagenmeyer, Wajih Ul Hassan
TL;DR
This work addresses the challenge of detecting cyber persistence threats within Advanced Persistent Threats by reframing persistence as a two-phase process: a persistence setup and a subsequent persistence execution. It introduces Cyber Persistence Detector (CPD), which uses provenance analytics to connect these phases via pseudo-dependency edges and augments tracing with expert-guided edges, complemented by an alert triage mechanism. Empirical evaluation on public datasets and MITRE emulation plans shows CPD achieves substantial false-positive reduction (averaging 93%), accurate persistence attack graphs, and low runtime overhead. The approach provides explainable, scalable persistence detection that leverages MITRE ATT&CK semantics and is adaptable to enterprise logging infrastructures.
Abstract
In Advanced Persistent Threat (APT) attacks, achieving stealthy persistence within target systems is often crucial for an attacker's success. This persistence allows adversaries to maintain prolonged access, often evading detection mechanisms. Recognizing its pivotal role in the APT lifecycle, this paper introduces Cyber Persistence Detector (CPD), a novel system dedicated to detecting cyber persistence through provenance analytics. CPD is founded on the insight that persistent operations typically manifest in two phases: the "persistence setup" and the subsequent "persistence execution". By causally relating these phases, we enhance our ability to detect persistent threats. First, CPD discerns setups signaling an impending persistent threat and then traces processes linked to remote connections to identify persistence execution activities. A key feature of our system is the introduction of pseudo-dependency edges (pseudo-edges), which effectively connect these disjoint phases using data provenance analysis, and expert-guided edges, which enable faster tracing and reduced log size. These edges empower us to detect persistence threats accurately and efficiently. Moreover, we propose a novel alert triage algorithm that further reduces false positives associated with persistence threats. Evaluations conducted on well-known datasets demonstrate that our system reduces the average false positive rate by 93% compared to state-of-the-art methods.
