Training Users Against Human and GPT-4 Generated Social Engineering Attacks
Tailia Malloy, Maria Jose Ferreira, Fei Fang, Cleotilde Gonzalez
TL;DR
Phishing threats increasingly leverage LLMs to craft realistic emails and styling, motivating an evaluation of training methods under AI-generated content. The authors compare human-written, GPT-4-written, and GPT-4-styled emails in a structured 2×2 experiment and propose an Instance-Based Learning (IBL) cognitive framework to predict learning and optimize training via embedding-based email selection. Key findings include significant style effects, an AI-writing bias that affects phishing judgments, and simulations showing that adaptive IBL-driven email selection yields larger learning gains than random training. The work demonstrates how cognitive modeling and AI-generated training data can inform safer cybersecurity education and guide future phishing-defense training designs.
Abstract
In real-world decision making, outcomes are often delayed, meaning individuals must make multiple decisions before receiving any feedback. Moreover, feedback can be presented in different ways: it may summarize the overall results of multiple decisions (aggregated feedback) or report the outcome of individual decisions after some delay (clustered feedback). Despite its importance, the timing and presentation of delayed feedback has received little attention in cognitive modeling of decision-making, which typically focuses on immediate feedback. To address this, we conducted an experiment to compare the effect of delayed vs. immediate feedback and aggregated vs. clustered feedback. We also propose a Hierarchical Instance-Based Learning (HIBL) model that captures how people make decisions in delayed feedback settings. HIBL uses a super-model that chooses between sub-models to perform the decision-making task until an outcome is observed. Simulations show that HIBL best predicts human behavior and specific patterns, demonstrating the flexibility of IBL models.
