Using LLMs for Tabletop Exercises within the Security Domain

Sam Hays; Jules White

Using LLMs for Tabletop Exercises within the Security Domain

Sam Hays, Jules White

TL;DR

The paper addresses the high cost and slow cadence of traditional security tabletop exercises and proposes using Large Language Models (LLMs) to streamline scenario generation, moderation, and retrospective analysis. It demonstrates how LLMs can generate and moderate live tabletop scenarios, provide iterative feedback, and support micro-tabletops that focus on specific domains for continuous improvement. Key contributions include explicit preparedness metrics with $P = (S + K + R + C + A + E)/P_{max}$, the preparedness delta $\Delta P = P_1 - P_2$, and the Unified Preparedness and Balance Score $UPBS = \alpha P_{avg} + \beta (1 - |\bar{\Delta P}|)$, plus practical methods for scenario generation and automated recommendations. The results indicate potential reductions in cost and planning time, higher exercise frequency, and more relevant security readiness outcomes through AI-assisted tabletop workflows.

Abstract

Tabletop exercises are a crucial component of many company's strategy to test and evaluate its preparedness for security incidents in a realistic way. Traditionally led by external firms specializing in cybersecurity, these exercises can be costly, time-consuming, and may not always align precisely with the client's specific needs. Large Language Models (LLMs) like ChatGPT offer a compelling alternative. They enable faster iteration, provide rich and adaptable simulations, and offer infinite patience in handling feedback and recommendations. This approach can enhances the efficiency and relevance of security preparedness exercises.

Using LLMs for Tabletop Exercises within the Security Domain

TL;DR

, the preparedness delta

, and the Unified Preparedness and Balance Score

, plus practical methods for scenario generation and automated recommendations. The results indicate potential reductions in cost and planning time, higher exercise frequency, and more relevant security readiness outcomes through AI-assisted tabletop workflows.

Abstract

Paper Structure (17 sections, 4 equations, 11 figures)

This paper contains 17 sections, 4 equations, 11 figures.

Abstract
Introduction
Background
Traditional Approach
Challenges With The Traditional Approach
Cost
Complexity of Planning
Team Preparedness
Benefits & Importance of Tabletop Exercises
Large Language Models Overview
A Brief Introduction to LLMs
Capabilities of LLMs for Tabletop Exercises
Scenario Generation
Retrospective and Recommendation
Micro-Tabletop
...and 2 more sections

Figures (11)

Figure 1: Tabletop Exercise: Common Workflow
Figure 2: Preparedness Equation
Figure 3: Preparedness Delta Equation
Figure 4: UPBS Score as Alpha Changes Across Different Team Configurations
Figure 5: UPBS Scores by Team Configuration and Selected $\alpha$ Values
...and 6 more figures

Using LLMs for Tabletop Exercises within the Security Domain

TL;DR

Abstract

Using LLMs for Tabletop Exercises within the Security Domain

Authors

TL;DR

Abstract

Table of Contents

Figures (11)