Explainable Artificial Intelligence (XAI) for Malware Analysis: A Survey of Techniques, Applications, and Open Challenges

Harikha Manthena; Shaghayegh Shajarian; Jeffrey Kimmell; Mahmoud Abdelsalam; Sajad Khorsandroo; Maanak Gupta

Explainable Artificial Intelligence (XAI) for Malware Analysis: A Survey of Techniques, Applications, and Open Challenges

Harikha Manthena, Shaghayegh Shajarian, Jeffrey Kimmell, Mahmoud Abdelsalam, Sajad Khorsandroo, Maanak Gupta

TL;DR

This survey analyzes explainable ML in malware analysis, addressing the tension between detection accuracy and interpretability. It surveys a broad range of techniques, spanning transparent and opaque models, post-hoc explanations, and model-specific/global approaches, and organizes them around concrete malware domains (Windows PE, Android, PDF, Linux, hardware). Key contributions include a taxonomy of explainability methods, a consolidated review of XAI-enabled malware classification/detection across multiple platforms, and a discussion of open challenges such as dataset quality, hybrid analysis, and defense against adversarial threats. The work provides a structured resource to help researchers and practitioners bridge high detection performance with human-understandable explanations in cybersecurity.

Abstract

Machine learning (ML) has rapidly advanced in recent years, revolutionizing fields such as finance, medicine, and cybersecurity. In malware detection, ML-based approaches have demonstrated high accuracy; however, their lack of transparency poses a significant challenge. Traditional black-box models often fail to provide interpretable justifications for their predictions, limiting their adoption in security-critical environments where understanding the reasoning behind a detection is essential for threat mitigation and response. Explainable AI (XAI) addresses this gap by enhancing model interpretability while maintaining strong detection capabilities. This survey presents a comprehensive review of state-of-the-art ML techniques for malware analysis, with a specific focus on explainability methods. We examine existing XAI frameworks, their application in malware classification and detection, and the challenges associated with making malware detection models more interpretable. Additionally, we explore recent advancements and highlight open research challenges in the field of explainable malware analysis. By providing a structured overview of XAI-driven malware detection approaches, this survey serves as a valuable resource for researchers and practitioners seeking to bridge the gap between ML performance and explainability in cybersecurity.

Explainable Artificial Intelligence (XAI) for Malware Analysis: A Survey of Techniques, Applications, and Open Challenges

TL;DR

Abstract

Paper Structure (48 sections, 4 equations, 6 figures, 4 tables)

This paper contains 48 sections, 4 equations, 6 figures, 4 tables.

Introduction
Malware Detection Approaches
File Classification Approach
Static Analysis
Dynamic Analysis
Hybrid Analysis
Online Malware Detection
Explainability in Machine Learning
Transparent Machine-Learning Models
Rule-Based Models
Regression Models
Decision Trees
Bayesian Models
k-Nearest Neighbour
Generalized Linear and Additive Models
...and 33 more sections

Figures (6)

Figure 1: Machine Learning-Based File Classification Techniques
Figure 2: Explainable Machine Learning Concept
Figure 3: Taxonomy for Explainable Machine Learning Techniques Inspired by belle2021principles, chou2022counterfactuals and arrieta2020explainable.
Figure 4: Explainable Malware Classification and Detection Approaches
Figure 5: Explainable Hardware Malware generation workflow pan2020hardware.
...and 1 more figures

Explainable Artificial Intelligence (XAI) for Malware Analysis: A Survey of Techniques, Applications, and Open Challenges

TL;DR

Abstract

Explainable Artificial Intelligence (XAI) for Malware Analysis: A Survey of Techniques, Applications, and Open Challenges

Authors

TL;DR

Abstract

Table of Contents

Figures (6)