Knowledge-Informed Auto-Penetration Testing Based on Reinforcement Learning with Reward Machine
Yuanliang Li, Hanzheng Dai, Jun Yan
TL;DR
The paper tackles automated penetration testing (AutoPT) using reinforcement learning guided by domain knowledge expressed as Reward Machines. It introduces DRLRM-PT, which formulates PT as a POMDP and uses two RM designs to decompose lateral-movement tasks and assign phase-specific rewards, optimized with the Deep Q-learning with RM (DQRM) approach. Empirical evaluation on CyberBattleSim shows that RM-guided agents learn faster and achieve better PT efficiency than baselines, with richer RS knowledge (RM2) yielding the best performance. The work demonstrates that integrating cybersecurity knowledge bases into RL can enhance sample efficiency, interpretability, and outcome quality, with practical implications for scalable, automated security testing.
Abstract
Automated penetration testing (AutoPT) based on reinforcement learning (RL) has proven its ability to improve the efficiency of vulnerability identification in information systems. However, RL-based PT encounters several challenges, including poor sampling efficiency, intricate reward specification, and limited interpretability. To address these issues, we propose a knowledge-informed AutoPT framework called DRLRM-PT, which leverages reward machines (RMs) to encode domain knowledge as guidelines for training a PT policy. In our study, we specifically focus on lateral movement as a PT case study and formulate it as a partially observable Markov decision process (POMDP) guided by RMs. We design two RMs based on the MITRE ATT\&CK knowledge base for lateral movement. To solve the POMDP and optimize the PT policy, we employ the deep Q-learning algorithm with RM (DQRM). The experimental results demonstrate that the DQRM agent exhibits higher training efficiency in PT compared to agents without knowledge embedding. Moreover, RMs encoding more detailed domain knowledge demonstrated better PT performance compared to RMs with simpler knowledge.
