MIBench: A Comprehensive Framework for Benchmarking Model Inversion Attack and Defense
Yixiang Qiu, Hongyao Yu, Hao Fang, Tianqu Zhuang, Wenbo Yu, Bin Chen, Xuan Wang, Shu-Tao Xia, Ke Xu
TL;DR
Model inversion attacks threaten privacy by reconstructing training data from model outputs. The paper introduces MIBench, a comprehensive benchmark with a modular toolbox that integrates 19 MI attack/defense methods and 9 evaluation protocols to enable reproducible, large-scale comparisons. Through extensive experiments across resolutions, model predictive power, defense strategies, and adversarial robustness, the authors reveal how attack effectiveness scales with target accuracy and how defenses perform under diverse conditions. The benchmark provides a foundation for fair evaluation and guides future research toward robust privacy-preserving MI defenses.
Abstract
Model Inversion (MI) attacks aim at leveraging the output information of target models to reconstruct privacy-sensitive training data, raising critical concerns regarding the privacy vulnerabilities of Deep Neural Networks (DNNs). Unfortunately, in tandem with the rapid evolution of MI attacks, the absence of a comprehensive benchmark with standardized metrics and reproducible implementations has emerged as a formidable challenge. This deficiency has hindered objective comparison of methodological advancements and reliable assessment of defense efficacy. To address this critical gap, we build the first practical benchmark named MIBench for systematic evaluation of model inversion attacks and defenses. This benchmark bases on an extensible and reproducible modular-based toolbox which currently integrates a total of 19 state-of-the-art attack and defense methods and encompasses 9 standardized evaluation protocols. Capitalizing on this foundation, we conduct extensive evaluation from multiple perspectives to holistically compare and analyze various methods across different scenarios, such as the impact of target resolution, model predictive power, defense performance and adversarial robustness.
