MagLive: Robust Voice Liveness Detection on Smartphones Using Magnetic Pattern Changes

Xiping Sun; Jing Chen; Cong Wu; Kun He; Haozhe Xu; Yebo Feng; Ruiying Du; Xianhao Chen

MagLive: Robust Voice Liveness Detection on Smartphones Using Magnetic Pattern Changes

Xiping Sun, Jing Chen, Cong Wu, Kun He, Haozhe Xu, Yebo Feng, Ruiying Du, Xianhao Chen

TL;DR

MagLive addresses the vulnerability of smartphone voice authentication to replay spoofing by leveraging magnetic pattern changes produced during speech. It introduces a magnetometer-based liveness detector that uses a TF-CNN-SAF feature extractor and supervised contrastive learning to produce user-, device-, and content-irrelevant representations. The approach achieves high security performance, with an average BAC of 99.01% and EER of 0.77% across diverse devices, environments, and attack scenarios, while requiring no active sensing or extra hardware. This work demonstrates a practical, on-device defense that strengthens voice authentication on smartphones with minimal user burden.

Abstract

Voice authentication has been widely used on smartphones. However, it remains vulnerable to spoofing attacks, where the attacker replays recorded voice samples from authentic humans using loudspeakers to bypass the voice authentication system. In this paper, we present MagLive, a robust voice liveness detection scheme designed for smartphones to mitigate such spoofing attacks. MagLive leverages the differences in magnetic pattern changes generated by different speakers (i.e., humans or loudspeakers) when speaking for liveness detection, which are captured by the built-in magnetometer on smartphones. To extract effective and robust magnetic features, MagLive utilizes a TF-CNN-SAF model as the feature extractor, which includes a time-frequency convolutional neural network (TF-CNN) combined with a self-attention-based fusion (SAF) model. Supervised contrastive learning is then employed to achieve user-irrelevance, device-irrelevance, and content-irrelevance. MagLive imposes no additional burden on users and does not rely on active sensing or specialized hardware. We conducted comprehensive experiments with various settings to evaluate the security and robustness of MagLive. Our results demonstrate that MagLive effectively distinguishes between humans and attackers (i.e., loudspeakers), achieving an average balanced accuracy (BAC) of 99.01% and an equal error rate (EER) of 0.77%.

MagLive: Robust Voice Liveness Detection on Smartphones Using Magnetic Pattern Changes

TL;DR

Abstract

Paper Structure (23 sections, 4 equations, 23 figures, 5 tables)

This paper contains 23 sections, 4 equations, 23 figures, 5 tables.

Introduction
Preliminaries
Magnetic Effect of Speakers
Motivating Examples
Overview of MagLive
System Overview
Threat Model
Design Goals
Design of MagLive
Data Capture
Data Preprocessing
Feature Extraction
Authentication
Evaluation
Experiment Setup
...and 8 more sections

Figures (23)

Figure 1: Illustration of MagLive. (a) It uses the built-in magnetometer and microphone on the smartphone for voice liveness detection. (b) It detects the liveness of the voice to determine whether it is from an authentic human or artificially reproduced.
Figure 2: An example of user 1 speaking digits from zero to four (human).
Figure 3: An example of user 2 speaking digits from zero to four (human).
Figure 4: An example of Pixel3a replaying the speech of User 1 (loudspeaker).
Figure 5: An example of P30 replaying the speech of user 1 (loudspeaker).
...and 18 more figures

MagLive: Robust Voice Liveness Detection on Smartphones Using Magnetic Pattern Changes

TL;DR

Abstract

MagLive: Robust Voice Liveness Detection on Smartphones Using Magnetic Pattern Changes

Authors

TL;DR

Abstract

Table of Contents

Figures (23)