Zero-Shot Spam Email Classification Using Pre-trained Large Language Models

Sergio Rojas-Galeano

Zero-Shot Spam Email Classification Using Pre-trained Large Language Models

Sergio Rojas-Galeano

TL;DR

Initial findings on a single dataset suggest the potential for classification pipelines of LLM-based subtasks (e.g., summarisation and classification), but further validation on diverse datasets is necessary.

Abstract

This paper investigates the application of pre-trained large language models (LLMs) for spam email classification using zero-shot prompting. We evaluate the performance of both open-source (Flan-T5) and proprietary LLMs (ChatGPT, GPT-4) on the well-known SpamAssassin dataset. Two classification approaches are explored: (1) truncated raw content from email subject and body, and (2) classification based on summaries generated by ChatGPT. Our empirical analysis, leveraging the entire dataset for evaluation without further training, reveals promising results. Flan-T5 achieves a 90% F1-score on the truncated content approach, while GPT-4 reaches a 95% F1-score using summaries. While these initial findings on a single dataset suggest the potential for classification pipelines of LLM-based subtasks (e.g., summarisation and classification), further validation on diverse datasets is necessary. The high operational costs of proprietary models, coupled with the general inference costs of LLMs, could significantly hinder real-world deployment for spam filtering.

Zero-Shot Spam Email Classification Using Pre-trained Large Language Models

TL;DR

Abstract

Paper Structure (12 sections, 1 figure, 3 tables)

This paper contains 12 sections, 1 figure, 3 tables.

Introduction
Methods
Large Language Models
Zero-shot learning
Classification Scenarios
Prompt design
Performance Metrics
Results
Prediction from Raw Content
Prediction from Summary
Discussion
Conclusion

Figures (1)

Figure 1: Schematic of two zero-shot LLM approaches for spam email classification: (a) Prediction from raw email content with truncation; (b) Prediction from a ChatGPT-generated email summary. Both approaches utilise three LLMs (ChatGPT, GPT-4, and FLAN-T5) for independent classification.

Zero-Shot Spam Email Classification Using Pre-trained Large Language Models

TL;DR

Abstract

Zero-Shot Spam Email Classification Using Pre-trained Large Language Models

Authors

TL;DR

Abstract

Table of Contents

Figures (1)