PHANTOM: Progressive High-fidelity Adversarial Network for Threat Object Modeling
Jamal Al-Karaki, Muhammad Al-Zafar Khan, Rand Derar Mohammad Al Athamneh
TL;DR
The paper tackles the scarcity of labeled cyberattack data by proposing PHANTOM, a progressive, dual-path VAE-GAN framework with domain-specific feature matching to generate high-fidelity synthetic cyberattack samples. MAV-PFM enables stable reconstruction and high-fidelity generation across multiple resolutions, preserving temporal causality and behavioral semantics. On a 100,000-sample synthetic dataset spanning five attack types, models trained on PHANTOM data achieve near real-world performance, though rare attack types remain challenging due to severe class imbalance. The work offers a privacy-preserving data augmentation approach that can bolster intrusion detection while enabling controlled experimentation and benchmarking of synthetic data methods in cybersecurity.
Abstract
The scarcity of cyberattack data hinders the development of robust intrusion detection systems. This paper introduces PHANTOM, a novel adversarial variational framework for generating high-fidelity synthetic attack data. Its innovations include progressive training, a dual-path VAE-GAN architecture, and domain-specific feature matching to preserve the semantics of attacks. Evaluated on 100,000 network traffic samples, models trained on PHANTOM data achieve 98% weighted accuracy on real attacks. Statistical analyses confirm that the synthetic data preserves authentic distributions and diversity. Limitations in generating rare attack types are noted, highlighting challenges with severe class imbalance. This work advances the generation of synthetic data for training robust, privacy-preserving detection systems.
