A Game-Theoretic Analysis of Auditing Differentially Private Algorithms with Epistemically Disparate Herd
Ya-Ting Yang, Tao Zhang, Quanyan Zhu
TL;DR
Addresses the challenge of auditing privacy-preserving algorithms when audits are costly; this work constructs a Stackelberg herd-audit framework with a leader (developer) and heterogeneous followers (auditors) subject to rational inattention linked to $\epsilon$-DP; derives Bayes rule and optimal information strategies, and characterizes equilibria showing that lower $\lambda$ and lower information costs improve audit confidence and deter deviant behavior; concludes that herd audits can credibly threaten developers and enhance the responsible deployment of privacy preserving AI, with future work on end-user incentives and distributed-central auditing.
Abstract
Privacy-preserving AI algorithms are widely adopted in various domains, but the lack of transparency might pose accountability issues. While auditing algorithms can address this issue, machine-based audit approaches are often costly and time-consuming. Herd audit, on the other hand, offers an alternative solution by harnessing collective intelligence. Nevertheless, the presence of epistemic disparity among auditors, resulting in varying levels of expertise and access to knowledge, may impact audit performance. An effective herd audit will establish a credible accountability threat for algorithm developers, incentivizing them to uphold their claims. In this study, our objective is to develop a systematic framework that examines the impact of herd audits on algorithm developers using the Stackelberg game approach. The optimal strategy for auditors emphasizes the importance of easy access to relevant information, as it increases the auditors' confidence in the audit process. Similarly, the optimal choice for developers suggests that herd audit is viable when auditors face lower costs in acquiring knowledge. By enhancing transparency and accountability, herd audit contributes to the responsible development of privacy-preserving algorithms.
