LDPKiT: Superimposing Remote Queries for Privacy-Preserving Local Model Training
Kexin Li, Aastha Mehta, David Lie
TL;DR
LDPKiT addresses privacy concerns in leveraging proprietary remote models by injecting $oldsymbol{ ext{$bepsilon$-LDP}}$ noise into private data and augmenting it with a two-layer data augmentation strategy. It introduces LDPKiT-Rand and LDPKiT-Sup to generate a larger, privacy-protected inference set ${ m D_{infer}}$ that enables effective knowledge transfer to a local model while bounding leakage. Empirically, LDPKiT-Sup consistently recovers most of the utility lost to LDP noise across SVHN, Fashion-MNIST, and PathMNIST, with latent-space analyses showing that superimposed samples align better with target distributions than random noise. The work also demonstrates data reconstruction risks are mitigated under the proposed privacy regime and discusses ethical considerations and practical implications for real-world use, highlighting that the extracted local models are non-competitive and non-stealthy. Overall, LDPKiT offers a practical, privacy-preserving pathway to label and learn from sensitive data using remote models, with stronger privacy guarantees yielding meaningful utility gains at scale.
Abstract
Users of modern Machine Learning (ML) cloud services face a privacy conundrum -- on one hand, they may have concerns about sending private data to the service for inference, but on the other hand, for specialized models, there may be no alternative but to use the proprietary model of the ML service. In this work, we present LDPKiT, a framework for non-adversarial, privacy-preserving model extraction that leverages a user's private in-distribution data while bounding privacy leakage. LDPKiT introduces a novel superimposition technique that generates approximately in-distribution samples, enabling effective knowledge transfer under local differential privacy (LDP). Experiments on Fashion-MNIST, SVHN, and PathMNIST demonstrate that LDPKiT consistently improves utility while maintaining privacy, with benefits that become more pronounced at stronger noise levels. For example, on SVHN, LDPKiT achieves nearly the same inference accuracy at $ε=1.25$ as at $ε=2.0$, yielding stronger privacy guarantees with less than a 2% accuracy reduction. We further conduct sensitivity analyses to examine the effect of dataset size on performance and provide a systematic analysis of latent space representations, offering theoretical insights into the accuracy gains of LDPKiT.
