Inference Privacy: Properties and Mechanisms
Fengwei Tian, Ravi Tandon
TL;DR
The paper introduces Inference Privacy (IP), a framework for protecting user inputs during the inference phase by defining privacy within a radius $\\alpha$ under a metric $d$ and providing $(\\epsilon,\\delta)$-style guarantees. It shows that IP generalizes Local Differential Privacy (LDP) and develops two broad mechanism families: input perturbation and output perturbation, with concrete realizations such as Lap-Output and Gauss-Output (which use Lipschitz constants) and Gauss-Input. The work establishes key IP properties—post-processing invariance, basic and parallel composition, and a chaining property—and demonstrates their implications for privacy accounting. Empirical evaluations on CIFAR-10/100 with ResNet-18 and SDP-based Lipschitz layers illustrate the utility-privacy trade-offs and reveal when output perturbation provides advantages over input perturbation, informing practical deployment of inference-time privacy protections.
Abstract
Ensuring privacy during inference stage is crucial to prevent malicious third parties from reconstructing users' private inputs from outputs of public models. Despite a large body of literature on privacy preserving learning (which ensures privacy of training data), there is no existing systematic framework to ensure the privacy of users' data during inference. Motivated by this problem, we introduce the notion of Inference Privacy (IP), which can allow a user to interact with a model (for instance, a classifier, or an AI-assisted chat-bot) while providing a rigorous privacy guarantee for the users' data at inference. We establish fundamental properties of the IP privacy notion and also contrast it with the notion of Local Differential Privacy (LDP). We then present two types of mechanisms for achieving IP: namely, input perturbations and output perturbations which are customizable by the users and can allow them to navigate the trade-off between utility and privacy. We also demonstrate the usefulness of our framework via experiments and highlight the resulting trade-offs between utility and privacy during inference.
