Active Fourier Auditor for Estimating Distributional Properties of ML Models
Ayoub Ajarra, Bishwamittra Ghosh, Debabrota Basu
TL;DR
The paper addresses auditing three distributional properties of black-box classifiers—robustness, individual fairness, and group fairness—by avoiding full model reconstruction and instead exploiting a Fourier expansion of the classifier. It introduces the Active Fourier Auditor (AFA), which adaptively queries inputs to recover significant Fourier coefficients via a Goldreich–Levin style procedure and then estimates the properties with PAC guarantees. Theoretical results establish sample-complexity bounds: robustness and IF require $\tilde{O}\big(\frac{1}{\epsilon}\sqrt{\log(1/\delta)}\big)$ samples, while group fairness requires $\tilde{O}\big(\frac{1}{\epsilon^2}\log(1/\delta)\big)$, plus a $\tilde{\Omega}\big(\frac{\delta}{\epsilon^2}\big)$ lower bound for GF; NP-hardness of exact Fourier-coefficient computation motivates the approach. Empirically, AFA yields lower estimation error and better sample efficiency than baselines across multiple datasets and model types, demonstrating practical utility for universal, model-agnostic auditing of distributional properties.
Abstract
With the pervasive deployment of Machine Learning (ML) models in real-world applications, verifying and auditing properties of ML models have become a central concern. In this work, we focus on three properties: robustness, individual fairness, and group fairness. We discuss two approaches for auditing ML model properties: estimation with and without reconstruction of the target model under audit. Though the first approach is studied in the literature, the second approach remains unexplored. For this purpose, we develop a new framework that quantifies different properties in terms of the Fourier coefficients of the ML model under audit but does not parametrically reconstruct it. We propose the Active Fourier Auditor (AFA), which queries sample points according to the Fourier coefficients of the ML model, and further estimates the properties. We derive high probability error bounds on AFA's estimates, along with the worst-case lower bounds on the sample complexity to audit them. Numerically we demonstrate on multiple datasets and models that AFA is more accurate and sample-efficient to estimate the properties of interest than the baselines.
