Data Poisoning Attacks to Locally Differentially Private Frequent Itemset Mining Protocols
Wei Tong, Haoyu Chen, Jiacheng Niu, Sheng Zhong
TL;DR
This work addresses the vulnerability of locally differentially private frequent itemset mining to data poisoning. It introduces Adaptive Orchestration Attack (AOA), a unified framework built from Attack Resource Estimation, Target Set Refinement, and Poisoned Data Generation, and shows how to compromise state-of-the-art protocols like LDPMiner, SVIM, SVSM, and FIML under various threat models, including limited knowledge and MITM scenarios. The authors provide extensive experiments across multiple real-world datasets, demonstrating that AOA can significantly degrade top‑k results, often outperforming baseline attacks. They also discuss defenses, including FO-level protections and protocol randomness, but find that robust, universally effective defenses remain an open challenge, underscoring the need for more resilient LDP mining systems in practice.
Abstract
Local differential privacy (LDP) provides a way for an untrusted data collector to aggregate users' data without violating their privacy. Various privacy-preserving data analysis tasks have been studied under the protection of LDP, such as frequency estimation, frequent itemset mining, and machine learning. Despite its privacy-preserving properties, recent research has demonstrated the vulnerability of certain LDP protocols to data poisoning attacks. However, existing data poisoning attacks are focused on basic statistics under LDP, such as frequency estimation and mean/variance estimation. As an important data analysis task, the security of LDP frequent itemset mining has yet to be thoroughly examined. In this paper, we aim to address this issue by presenting novel and practical data poisoning attacks against LDP frequent itemset mining protocols. By introducing a unified attack framework with composable attack operations, our data poisoning attack can successfully manipulate the state-of-the-art LDP frequent itemset mining protocols and has the potential to be adapted to other protocols with similar structures. We conduct extensive experiments on three datasets to compare the proposed attack with four baseline attacks. The results demonstrate the severity of the threat and the effectiveness of the proposed attack.
