Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms

Jonat John Mathew; Rakin Ahsan; Sae Furukawa; Jagdish Gautham Krishna Kumar; Huzaifa Pallan; Agamjeet Singh Padda; Sara Adamski; Madhu Reddiboina; Arjun Pankajakshan

Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms

Jonat John Mathew, Rakin Ahsan, Sae Furukawa, Jagdish Gautham Krishna Kumar, Huzaifa Pallan, Agamjeet Singh Padda, Sara Adamski, Madhu Reddiboina, Arjun Pankajakshan

TL;DR

This study assesses the viability of employing static deepfake audio detection models in real-time communication platforms, and proposes strategies and frameworks for enhancing these models, paving the way for real-time deepfake audio detection in communication platforms.

Abstract

Deepfake audio poses a rising threat in communication platforms, necessitating real-time detection for audio stream integrity. Unlike traditional non-real-time approaches, this study assesses the viability of employing static deepfake audio detection models in real-time communication platforms. An executable software is developed for cross-platform compatibility, enabling real-time execution. Two deepfake audio detection models based on Resnet and LCNN architectures are implemented using the ASVspoof 2019 dataset, achieving benchmark performances compared to ASVspoof 2019 challenge baselines. The study proposes strategies and frameworks for enhancing these models, paving the way for real-time deepfake audio detection in communication platforms. This work contributes to the advancement of audio stream security, ensuring robust detection capabilities in dynamic, real-time communication scenarios.

Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms

TL;DR

Abstract

Paper Structure (13 sections, 2 equations, 4 figures, 2 tables)

This paper contains 13 sections, 2 equations, 4 figures, 2 tables.

Introduction
Methodology
Dataset and Experimental Details
Model pipeline and Training
Software application development
Results and Discussion
ASVspoof 2019 Evaluation data
Teams Meeting dataset
Recommended Strategies
Training Data Variability
Data Augmentation
Generative Model Approaches
Conclusion

Figures (4)

Figure 1: A block diagram of various deepfake audio detection systems.
Figure 2: A block diagram showing the typical use-case of our deepfake audio detection system.
Figure 3: Pipeline of a deepfake audio detection system.
Figure 4: Flow diagram of the software application development process.

Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms

TL;DR

Abstract

Towards the Development of a Real-Time Deepfake Audio Detection System in Communication Platforms

Authors

TL;DR

Abstract

Table of Contents

Figures (4)