Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Georgia Channing; Juil Sock; Ronald Clark; Philip Torr; Christian Schroeder de Witt

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

Georgia Channing, Juil Sock, Ronald Clark, Philip Torr, Christian Schroeder de Witt

TL;DR

Novel explainability methods for state-of-the-art transformer-based audio deepfake detectors are introduced and a novel benchmark for real-world generalizability is open-source for real-world generalizability.

Abstract

The rapid proliferation of AI-manipulated or generated audio deepfakes poses serious challenges to media integrity and election security. Current AI-driven detection solutions lack explainability and underperform in real-world settings. In this paper, we introduce novel explainability methods for state-of-the-art transformer-based audio deepfake detectors and open-source a novel benchmark for real-world generalizability. By narrowing the explainability gap between transformer-based audio deepfake detectors and traditional methods, our results not only build trust with human experts, but also pave the way for unlocking the potential of citizen intelligence to overcome the scalability issue in audio deepfake detection.

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

TL;DR

Abstract

Toward Robust Real-World Audio Deepfake Detection: Closing the Explainability Gap

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)