Table of Contents
Fetching ...

More Questions than Answers? Lessons from Integrating Explainable AI into a Cyber-AI Tool

Ashley Suh, Harry Li, Caitlin Kenney, Kenneth Alperin, Steven R. Gomez

TL;DR

The paper investigates integrating Explainable AI into a cyber-operations workflow by applying SHAP and LIME to a source-code classifier and embedding the explanations in a decision-support tool. It finds that post-hoc, local explanations are often hard for non-experts to interpret, and disagreements between explanation methods can undermine trust. The authors argue for higher-level, domain-tailored visualizations and discuss potential remedies, including dialogue-assisted explanations and emerging LLMs, while warning about hallucinations. The work highlights practical gaps in XAI for cybersecurity and suggests directions to make explanations more usable in real-time operator workflows.

Abstract

We share observations and challenges from an ongoing effort to implement Explainable AI (XAI) in a domain-specific workflow for cybersecurity analysts. Specifically, we briefly describe a preliminary case study on the use of XAI for source code classification, where accurate assessment and timeliness are paramount. We find that the outputs of state-of-the-art saliency explanation techniques (e.g., SHAP or LIME) are lost in translation when interpreted by people with little AI expertise, despite these techniques being marketed for non-technical users. Moreover, we find that popular XAI techniques offer fewer insights for real-time human-AI workflows when they are post hoc and too localized in their explanations. Instead, we observe that cyber analysts need higher-level, easy-to-digest explanations that can offer as little disruption as possible to their workflows. We outline unaddressed gaps in practical and effective XAI, then touch on how emerging technologies like Large Language Models (LLMs) could mitigate these existing obstacles.

More Questions than Answers? Lessons from Integrating Explainable AI into a Cyber-AI Tool

TL;DR

The paper investigates integrating Explainable AI into a cyber-operations workflow by applying SHAP and LIME to a source-code classifier and embedding the explanations in a decision-support tool. It finds that post-hoc, local explanations are often hard for non-experts to interpret, and disagreements between explanation methods can undermine trust. The authors argue for higher-level, domain-tailored visualizations and discuss potential remedies, including dialogue-assisted explanations and emerging LLMs, while warning about hallucinations. The work highlights practical gaps in XAI for cybersecurity and suggests directions to make explanations more usable in real-time operator workflows.

Abstract

We share observations and challenges from an ongoing effort to implement Explainable AI (XAI) in a domain-specific workflow for cybersecurity analysts. Specifically, we briefly describe a preliminary case study on the use of XAI for source code classification, where accurate assessment and timeliness are paramount. We find that the outputs of state-of-the-art saliency explanation techniques (e.g., SHAP or LIME) are lost in translation when interpreted by people with little AI expertise, despite these techniques being marketed for non-technical users. Moreover, we find that popular XAI techniques offer fewer insights for real-time human-AI workflows when they are post hoc and too localized in their explanations. Instead, we observe that cyber analysts need higher-level, easy-to-digest explanations that can offer as little disruption as possible to their workflows. We outline unaddressed gaps in practical and effective XAI, then touch on how emerging technologies like Large Language Models (LLMs) could mitigate these existing obstacles.
Paper Structure (6 sections, 2 figures)

This paper contains 6 sections, 2 figures.

Figures (2)

  • Figure 1: Before presenting any saliency explanations, a table presents the classification confidence scores indicating how likely the source code file implements ML, or a subtype of ML.
  • Figure 2: Visual highlighting for a Python file based on SHAP and LIME explanations of the "implements ML?" decision.