Table of Contents
Fetching ...

Exploring the Robustness of AI-Driven Tools in Digital Forensics: A Preliminary Study

Silvia Lucia Sanna, Leonardo Regano, Davide Maiorca, Giorgio Giacinto

TL;DR

This paper assesses how AI-driven digital forensics tools perform under adversarial conditions, using Magnet AI and Excire Photo AI to label and search evidence including nude content and faces. By constructing a custom dataset with real/nude images, deepfakes, and simulated chats, it exposes robustness gaps and high false positive rates. The study highlights that current AI in DF cannot fully replace human analysts and proposes privacy-preserving, on-device processing with explainable AI to mitigate risks. It also discusses implications for anti-forensics and outlines directions for improving tool resilience and defend against adversarial manipulation.

Abstract

Nowadays, many tools are used to facilitate forensic tasks about data extraction and data analysis. In particular, some tools leverage Artificial Intelligence (AI) to automatically label examined data into specific categories (\ie, drugs, weapons, nudity). However, this raises a serious concern about the robustness of the employed AI algorithms against adversarial attacks. Indeed, some people may need to hide specific data to AI-based digital forensics tools, thus manipulating the content so that the AI system does not recognize the offensive/prohibited content and marks it at as suspicious to the analyst. This could be seen as an anti-forensics attack scenario. For this reason, we analyzed two of the most important forensics tools employing AI for data classification: Magnet AI, used by Magnet Axiom, and Excire Photo AI, used by X-Ways Forensics. We made preliminary tests using about $200$ images, other $100$ sent in $3$ chats about pornography and teenage nudity, drugs and weapons to understand how the tools label them. Moreover, we loaded some deepfake images (images generated by AI forging real ones) of some actors to understand if they would be classified in the same category as the original images. From our preliminary study, we saw that the AI algorithm is not robust enough, as we expected since these topics are still open research problems. For example, some sexual images were not categorized as nudity, and some deepfakes were categorized as the same real person, while the human eye can see the clear nudity image or catch the difference between the deepfakes. Building on these results and other state-of-the-art works, we provide some suggestions for improving how digital forensics analysis tool leverage AI and their robustness against adversarial attacks or different scenarios than the trained one.

Exploring the Robustness of AI-Driven Tools in Digital Forensics: A Preliminary Study

TL;DR

This paper assesses how AI-driven digital forensics tools perform under adversarial conditions, using Magnet AI and Excire Photo AI to label and search evidence including nude content and faces. By constructing a custom dataset with real/nude images, deepfakes, and simulated chats, it exposes robustness gaps and high false positive rates. The study highlights that current AI in DF cannot fully replace human analysts and proposes privacy-preserving, on-device processing with explainable AI to mitigate risks. It also discusses implications for anti-forensics and outlines directions for improving tool resilience and defend against adversarial manipulation.

Abstract

Nowadays, many tools are used to facilitate forensic tasks about data extraction and data analysis. In particular, some tools leverage Artificial Intelligence (AI) to automatically label examined data into specific categories (\ie, drugs, weapons, nudity). However, this raises a serious concern about the robustness of the employed AI algorithms against adversarial attacks. Indeed, some people may need to hide specific data to AI-based digital forensics tools, thus manipulating the content so that the AI system does not recognize the offensive/prohibited content and marks it at as suspicious to the analyst. This could be seen as an anti-forensics attack scenario. For this reason, we analyzed two of the most important forensics tools employing AI for data classification: Magnet AI, used by Magnet Axiom, and Excire Photo AI, used by X-Ways Forensics. We made preliminary tests using about images, other sent in chats about pornography and teenage nudity, drugs and weapons to understand how the tools label them. Moreover, we loaded some deepfake images (images generated by AI forging real ones) of some actors to understand if they would be classified in the same category as the original images. From our preliminary study, we saw that the AI algorithm is not robust enough, as we expected since these topics are still open research problems. For example, some sexual images were not categorized as nudity, and some deepfakes were categorized as the same real person, while the human eye can see the clear nudity image or catch the difference between the deepfakes. Building on these results and other state-of-the-art works, we provide some suggestions for improving how digital forensics analysis tool leverage AI and their robustness against adversarial attacks or different scenarios than the trained one.

Paper Structure

This paper contains 21 sections, 10 figures.

Figures (10)

  • Figure 1: The bar chart shows the percentage of nude images detected as nude by Magnet AI tool for each of the examined $4$ categories: t-shirt, real porn, manga and teenager, each one containing $50$ pictures.
  • Figure 2: The bar chart shows the percentage accuracy for the chat tests made with the Magnet AI analysis. The dataset is made of drug and nude pictures. For each analyzed dataset, the left green bar is the samples detected correctly (true positives), while the yellow right bar represents the other images labeled as those categories (false positives).
  • Figure 3: The bar chart shows the accuracy of the Excire Photo AI picture detection algorithm, in terms of percentage of correctly detected faces contained in the datasets described in Section \ref{['sec:experiments:subsec:dataset']} (true positives, green bar on the left) and the percentage of different pictures erroneously classified as portraying the queried person (false positives, violet bar on the right).
  • Figure 4: Dataset of nude t-shirts (top) and the detected results (bottom) by the Magnet AI tool for nude detection experiments on painted dresses.
  • Figure 5: The image shows the pictures detected as drugs but that clearly are not drugs (false positive rates) such as the grape (red circled) for the drug detection in chat done by Magnet AI tool.
  • ...and 5 more figures