Table of Contents
Fetching ...

Prompted Contextual Vectors for Spear-Phishing Detection

Daniel Nahmias, Gal Engelberg, Dan Klein, Asaf Shabtai

TL;DR

The paper addresses the challenge of detecting AI-assisted spear-phishing by introducing prompted contextual document vectors, produced by an ensemble of LLMs prompted to reason about targeted questions reflecting persuasion tactics. These vectors feed a binary classifier and demonstrate strong performance, achieving a 0.91 F1 on spear-phishing detection and robust generalization to related social-engineering threats, including smishing, under covariate drift. Key contributions include a novel LLM-based vectorization method, a publicly released spear-phishing dataset, and evidence that reasoning-enabled vectors align with an optimal feature space that resists concept drift. The work highlights practical implications for adversarial document classification and supports privacy-conscious deployment with discussion of limitations and future directions for automatic question generation and broader domain applicability.

Abstract

Spear-phishing attacks present a significant security challenge, with large language models (LLMs) escalating the threat by generating convincing emails and facilitating target reconnaissance. To address this, we propose a detection approach based on a novel document vectorization method that utilizes an ensemble of LLMs to create representation vectors. By prompting LLMs to reason and respond to human-crafted questions, we quantify the presence of common persuasion principles in the email's content, producing prompted contextual document vectors for a downstream supervised machine learning model. We evaluate our method using a unique dataset generated by a proprietary system that automates target reconnaissance and spear-phishing email creation. Our method achieves a 91\% F1 score in identifying LLM-generated spear-phishing emails, with the training set comprising only traditional phishing and benign emails. Key contributions include a novel document vectorization method utilizing LLM reasoning, a publicly available dataset of high-quality spear-phishing emails, and the demonstrated effectiveness of our method in detecting such emails. This methodology can be utilized for various document classification tasks, particularly in adversarial problem domains.

Prompted Contextual Vectors for Spear-Phishing Detection

TL;DR

The paper addresses the challenge of detecting AI-assisted spear-phishing by introducing prompted contextual document vectors, produced by an ensemble of LLMs prompted to reason about targeted questions reflecting persuasion tactics. These vectors feed a binary classifier and demonstrate strong performance, achieving a 0.91 F1 on spear-phishing detection and robust generalization to related social-engineering threats, including smishing, under covariate drift. Key contributions include a novel LLM-based vectorization method, a publicly released spear-phishing dataset, and evidence that reasoning-enabled vectors align with an optimal feature space that resists concept drift. The work highlights practical implications for adversarial document classification and supports privacy-conscious deployment with discussion of limitations and future directions for automatic question generation and broader domain applicability.

Abstract

Spear-phishing attacks present a significant security challenge, with large language models (LLMs) escalating the threat by generating convincing emails and facilitating target reconnaissance. To address this, we propose a detection approach based on a novel document vectorization method that utilizes an ensemble of LLMs to create representation vectors. By prompting LLMs to reason and respond to human-crafted questions, we quantify the presence of common persuasion principles in the email's content, producing prompted contextual document vectors for a downstream supervised machine learning model. We evaluate our method using a unique dataset generated by a proprietary system that automates target reconnaissance and spear-phishing email creation. Our method achieves a 91\% F1 score in identifying LLM-generated spear-phishing emails, with the training set comprising only traditional phishing and benign emails. Key contributions include a novel document vectorization method utilizing LLM reasoning, a publicly available dataset of high-quality spear-phishing emails, and the demonstrated effectiveness of our method in detecting such emails. This methodology can be utilized for various document classification tasks, particularly in adversarial problem domains.
Paper Structure (35 sections, 11 figures, 6 tables, 1 algorithm)

This paper contains 35 sections, 11 figures, 6 tables, 1 algorithm.

Figures (11)

  • Figure 1: Prompted contextual vectorization process.
  • Figure 2: Example of a spear-phishing email message generated by the system (PII blacked out).
  • Figure 3: t-SNE visualization - prompted contextual vectors.
  • Figure 4: t-SNE visualization - DistilBERT vectors.
  • Figure 5: Questions ablation study results.
  • ...and 6 more figures