Table of Contents
Fetching ...

Cyri: A Conversational AI-based Assistant for Supporting the Human User in Detecting and Responding to Phishing Attacks

Antonio La Torre, Marco Angelini

TL;DR

Cyri tackles phishing by deploying a privacy-preserving, locally hosted AI assistant that analyzes email content through semantic features. It combines a three-component architecture (LISA, Thunderbird plugin, VAC) with prompts that enable sentence-level phishing reasoning and an interactive conversation for explanations. A 840-email validation set and a 10-participant user study show Cyri achieves strong detection of key phishing signals and high usability for both experts and non-experts, while highlighting areas for improvement in less distinctive features. The work demonstrates the practical impact of on-device, human-centric phishing analysis that integrates with existing email workflows to enhance awareness and action against phishing threats.

Abstract

This work introduces Cyri, an AI-powered conversational assistant designed to support a human user in detecting and analyzing phishing emails by leveraging Large Language Models. Cyri has been designed to scrutinize emails for semantic features used in phishing attacks, such as urgency, and undesirable consequences, using an approach that unifies features already established in the literature with others by Cyri features extraction methodology. Cyri can be directly plugged into a client mail or webmail, ensuring seamless integration with the user's email workflow while maintaining data privacy through local processing. By performing analyses on the user's machine, Cyri eliminates the need to transmit sensitive email data over the internet, reducing associated security risks. The Cyri user interface has been designed to reduce habituation effects and enhance user engagement. It employs dynamic visual cues and context-specific explanations to keep users alert and informed while using emails. Additionally, it allows users to explore identified malicious semantic features both through conversation with the agent and visual exploration, obtaining the advantages of both modalities for expert or non-expert users. It also allows users to keep track of the conversation, supports the user in solving additional questions on both computed features or new parts of the mail, and applies its detection on demand. To evaluate Cyri, we crafted a comprehensive dataset of 420 phishing emails and 420 legitimate emails. Results demonstrate high effectiveness in identifying critical phishing semantic features fundamental to phishing detection. A user study involving 10 participants, both experts and non-experts, evaluated Cyri's effectiveness and usability. Results indicated that Cyri significantly aided users in identifying phishing emails and enhanced their understanding of phishing tactics.

Cyri: A Conversational AI-based Assistant for Supporting the Human User in Detecting and Responding to Phishing Attacks

TL;DR

Cyri tackles phishing by deploying a privacy-preserving, locally hosted AI assistant that analyzes email content through semantic features. It combines a three-component architecture (LISA, Thunderbird plugin, VAC) with prompts that enable sentence-level phishing reasoning and an interactive conversation for explanations. A 840-email validation set and a 10-participant user study show Cyri achieves strong detection of key phishing signals and high usability for both experts and non-experts, while highlighting areas for improvement in less distinctive features. The work demonstrates the practical impact of on-device, human-centric phishing analysis that integrates with existing email workflows to enhance awareness and action against phishing threats.

Abstract

This work introduces Cyri, an AI-powered conversational assistant designed to support a human user in detecting and analyzing phishing emails by leveraging Large Language Models. Cyri has been designed to scrutinize emails for semantic features used in phishing attacks, such as urgency, and undesirable consequences, using an approach that unifies features already established in the literature with others by Cyri features extraction methodology. Cyri can be directly plugged into a client mail or webmail, ensuring seamless integration with the user's email workflow while maintaining data privacy through local processing. By performing analyses on the user's machine, Cyri eliminates the need to transmit sensitive email data over the internet, reducing associated security risks. The Cyri user interface has been designed to reduce habituation effects and enhance user engagement. It employs dynamic visual cues and context-specific explanations to keep users alert and informed while using emails. Additionally, it allows users to explore identified malicious semantic features both through conversation with the agent and visual exploration, obtaining the advantages of both modalities for expert or non-expert users. It also allows users to keep track of the conversation, supports the user in solving additional questions on both computed features or new parts of the mail, and applies its detection on demand. To evaluate Cyri, we crafted a comprehensive dataset of 420 phishing emails and 420 legitimate emails. Results demonstrate high effectiveness in identifying critical phishing semantic features fundamental to phishing detection. A user study involving 10 participants, both experts and non-experts, evaluated Cyri's effectiveness and usability. Results indicated that Cyri significantly aided users in identifying phishing emails and enhanced their understanding of phishing tactics.

Paper Structure

This paper contains 23 sections, 12 figures, 6 tables.

Figures (12)

  • Figure 1: Cyri email Plugin Example using the Thunderbird email client
  • Figure 2: Cyri Architecture and Data Flow for Email Analysis
  • Figure 3: User Interaction and Query Processing Flow
  • Figure 4: The VAC interface of Cyri in action: red background identifies a phishing mail, with semantic features highlighted in the email text (a) and the list below (b). Conversation with LISA happens on the right (c) through the query interface (d) or by audio (e)
  • Figure 5: The VAC interface of Cyri for a safe email
  • ...and 7 more figures