AgentBnB: A Browser-Based Cybersecurity Tabletop Exercise with Large Language Model Support and Retrieval-Aligned Scaffolding

Arman Anwar; Zefang Liu

AgentBnB: A Browser-Based Cybersecurity Tabletop Exercise with Large Language Model Support and Retrieval-Aligned Scaffolding

Arman Anwar, Zefang Liu

TL;DR

This paper tackles the rigidity and scalability challenges of traditional cybersecurity tabletop exercises by introducing AgentBnB, a browser-based platform that blends large language model teammates with a retrieval-augmented instructional copilot (C2D2) to provide Bloom-aligned, just-in-time scaffolding. Building on Backdoors & Breaches, AgentBnB preserves procedural fidelity while enabling human-in-the-loop collaboration and data-driven pedagogy, grounded in a structured RAG architecture. The system delivers a lightweight, repeatable training experience with an emphasis on adaptive guidance, persistent state, and telemetry for learning analytics. Initial pilot results with four graduate students indicate favorable perceptions of scalability and usefulness, though limitations such as sample size and corpus breadth call for broader studies and multi-player extensions to validate learning gains at scale.

Abstract

Traditional cybersecurity tabletop exercises (TTXs) provide valuable training but are often scripted, resource-intensive, and difficult to scale. We introduce AgentBnB, a browser-based re-imagining of the Backdoors & Breaches game that integrates large language model teammates with a Bloom-aligned, retrieval-augmented copilot (C2D2). The system expands a curated corpus into factual, conceptual, procedural, and metacognitive snippets, delivering on-demand, cognitively targeted hints. Prompt-engineered agents employ a scaffolding ladder that gradually fades as learner confidence grows. In a solo-player pilot with four graduate students, participants reported greater intention to use the agent-based version compared to the physical card deck and viewed it as more scalable, though a ceiling effect emerged on a simple knowledge quiz. Despite limitations of small sample size, single-player focus, and narrow corpus, these early findings suggest that large language model augmented TTXs can provide lightweight, repeatable practice without the logistical burden of traditional exercises. Planned extensions include multi-player modes, telemetry-driven coaching, and comparative studies with larger cohorts.

AgentBnB: A Browser-Based Cybersecurity Tabletop Exercise with Large Language Model Support and Retrieval-Aligned Scaffolding

TL;DR

Abstract

AgentBnB: A Browser-Based Cybersecurity Tabletop Exercise with Large Language Model Support and Retrieval-Aligned Scaffolding

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)