Genos: General In-Network Unsupervised Intrusion Detection by Rule Extraction
Ruoyu Li, Qing Li, Yu Zhang, Dan Zhao, Xi Xiao, Yong Jiang
TL;DR
Genos tackles the need for high throughput, interpretable, unsupervised in-network intrusion detection by extracting axis aligned rules from any A-NIDS model and deploying them on programmable switches. Its divide-and-conquer rule extraction via a Score Clustering Tree and Decision Boundary Estimation enables faithful replication of the source model while supporting incremental updates that affect only subspaces. The framework comprises three control-plane modules Model Compiler, Model Interpreter, and Model Debugger, plus a data plane feature extractor implemented on a Tofino switch, achieving about 100 Gbps line rate with low latency. Experimental results on CIC-IDS and TON-IoT show high fidelity to source models, strong detection performance, effective interpretability, and efficient incremental updates, with a public code release for reproducibility.
Abstract
Anomaly-based network intrusion detection systems (A-NIDS) use unsupervised models to detect unforeseen attacks. However, existing A-NIDS solutions suffer from low throughput, lack of interpretability, and high maintenance costs. Recent in-network intelligence (INI) exploits programmable switches to offer line-rate deployment of NIDS. Nevertheless, current in-network NIDS are either model-specific or only apply to supervised models. In this paper, we propose Genos, a general in-network framework for unsupervised A-NIDS by rule extraction, which consists of a Model Compiler, a Model Interpreter, and a Model Debugger. Specifically, observing benign data are multimodal and usually located in multiple subspaces in the feature space, we utilize a divide-and-conquer approach for model-agnostic rule extraction. In the Model Compiler, we first propose a tree-based clustering algorithm to partition the feature space into subspaces, then design a decision boundary estimation mechanism to approximate the source model in each subspace. The Model Interpreter interprets predictions by important attributes to aid network operators in understanding the predictions. The Model Debugger conducts incremental updating to rectify errors by only fine-tuning rules on affected subspaces, thus reducing maintenance costs. We implement a prototype using physical hardware, and experiments demonstrate its superior performance of 100 Gbps throughput, great interpretability, and trivial updating overhead.
