SoK: Honeypots & LLMs, More Than the Sum of Their Parts?

Robert A. Bridges; Thomas R. Mitchell; Mauricio Muñoz; Ted Henriksson

SoK: Honeypots & LLMs, More Than the Sum of Their Parts?

Robert A. Bridges, Thomas R. Mitchell, Mauricio Muñoz, Ted Henriksson

TL;DR

This SoK examines the intersection of honeypots and large language models, addressing the problem of achieving convincing deception without prohibitive risk. It synthesizes three strands—a taxonomy of honeypot detection vectors, emergent canonical architectures for LLM-powered honeypots, and the evolution of honeypot log analysis toward automated threat intelligence—into a cohesive framework and roadmap. Key findings include the persistence of fingerprinting weaknesses, a growing yet immature canonical architecture with modular components, and an evaluation paradigm hampered by a data desert and a lack of target humanoid adversaries, alongside nascent work on autonomous, real-time threat intelligence. The paper argues for open, modular tooling, autonomous feedback loops, and an adversarial research ecosystem to enable self-improving deception systems capable of countering intelligent automated attackers, with potential impact on SOC workflows and threat intelligence pipelines.

Abstract

The advent of Large Language Models (LLMs) promised to resolve the long-standing paradox in honeypot design, achieving high-fidelity deception with low operational risk. Through a flurry of research since late 2022, steady progress from ideation to prototype implementation is exhibited. Since late 2022, a flurry of research has demonstrated steady progress from ideation to prototype implementation. While promising, evaluations show only incremental progress in real-world deployments, and the field still lacks a cohesive understanding of the emerging architectural patterns, core challenges, and evaluation paradigms. To fill this gap, this Systematization of Knowledge (SoK) paper provides the first comprehensive overview and analysis of this new domain. We survey and systematize the field by focusing on three critical, intersecting research areas: first, we provide a taxonomy of honeypot detection vectors, structuring the core problems that LLM-based realism must solve; second, we synthesize the emerging literature on LLM-powered honeypots, identifying a canonical architecture and key evaluation trends; and third, we chart the evolutionary path of honeypot log analysis, from simple data reduction to automated intelligence generation. We synthesize these findings into a forward-looking research roadmap, arguing that the true potential of this technology lies in creating autonomous, self-improving deception systems to counter the emerging threat of intelligent, automated attackers.

SoK: Honeypots & LLMs, More Than the Sum of Their Parts?

TL;DR

Abstract

SoK: Honeypots & LLMs, More Than the Sum of Their Parts?

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (1)