Is My Data in Your AI? Membership Inference Test (MINT) applied to Face Biometrics
Daniel DeAlcala, Aythami Morales, Julian Fierrez, Gonzalo Mancera, Ruben Tolosana, Javier Ortega-Garcia
TL;DR
The paper presents the Membership Inference Test (MINT), an auditing framework to determine if a specific data sample was used to train an AI model, demonstrated on state-of-the-art Face Recognition systems. It proposes two architectures, Vanilla MINT (MLP with per-channel activation pooling) and CNN MINT (CNN over activation maps), which leverage Auxiliary Auditable Data (AAD) and model embeddings to distinguish training versus external data. Evaluations across three FR models and six databases show up to $90\%$ accuracy, outperforming adapted MIAs and highlighting the potential for privacy and regulatory compliance in AI systems. The work also discusses practical deployment challenges, legal implications, and future directions including gradients and unsupervised extensions across domains beyond face data.
Abstract
This article introduces the Membership Inference Test (MINT), a novel approach that aims to empirically assess if given data was used during the training of AI/ML models. Specifically, we propose two MINT architectures designed to learn the distinct activation patterns that emerge when an Audited Model is exposed to data used during its training process. These architectures are based on Multilayer Perceptrons (MLPs) and Convolutional Neural Networks (CNNs). The experimental framework focuses on the challenging task of Face Recognition, considering three state-of-the-art Face Recognition systems. Experiments are carried out using six publicly available databases, comprising over 22 million face images in total. Different experimental scenarios are considered depending on the context of the AI model to test. Our proposed MINT approach achieves promising results, with up to 90\% accuracy, indicating the potential to recognize if an AI model has been trained with specific data. The proposed MINT approach can serve to enforce privacy and fairness in several AI applications, e.g., revealing if sensitive or private data was used for training or tuning Large Language Models (LLMs).
