Privacy-Preserving Model and Preprocessing Verification for Machine Learning
Wenbiao Li, Anisa Halimi, Xiaoqian Jiang, Jaideep Vaidya, Erman Ayday
TL;DR
This work tackles the challenge of verifying preprocessing integrity for ML models trained on sensitive data under privacy constraints. It introduces a framework that combines Local Differential Privacy with interpretable explanations from LIME and SHAP, enabling verification without exposing raw data. Empirical results across three real-world datasets show that ML-based verification excels in binary tasks, while threshold-based methods perform comparably in multi-class settings, with verification accuracy depending on the privacy budget. The approach offers strong privacy guarantees and practical utility for safeguarding data, with future work focusing on improved verification metrics and adaptive privacy strategies.
Abstract
This paper presents a framework for privacy-preserving verification of machine learning models, focusing on models trained on sensitive data. Integrating Local Differential Privacy (LDP) with model explanations from LIME and SHAP, our framework enables robust verification without compromising individual privacy. It addresses two key tasks: binary classification, to verify if a target model was trained correctly by applying the appropriate preprocessing steps, and multi-class classification, to identify specific preprocessing errors. Evaluations on three real-world datasets-Diabetes, Adult, and Student Record-demonstrate that while the ML-based approach is particularly effective in binary tasks, the threshold-based method performs comparably in multi-class tasks. Results indicate that although verification accuracy varies across datasets and noise levels, the framework provides effective detection of preprocessing errors, strong privacy guarantees, and practical applicability for safeguarding sensitive data.
