K-stars LDP: A Novel Framework for (p, q)-clique Enumeration under Local Differential Privacy
Henan Sun, Zhengyu Wu, Rong-Hua Li, Guoren Wang, Zening Li
TL;DR
The paper addresses private counting of $(p,q)$-cliques in bipartite graphs under Local Differential Privacy, identifying high noise in edge-LDP approaches as a core limitation. It introduces a $k$-stars LDP framework that obfuscates the existence of $k$-stars rather than individual edges, leveraging two-round user-collector interaction, Warner's Randomized Response, absolute value correction, and $k$-stars sampling to reduce estimation error. The authors prove that $ ext{$ ext{epsilon}$-}k$-stars LDP implies a bounded leakage relative to edge LDP, establish unbiasedness of the estimators, and derive variance bounds demonstrating improved utility, especially for larger $(p,q)$. Empirical results on four public bipartite graphs show that $k$-stars LDP outperforms edge LDP across metrics and settings, with pronounced gains in sparse graphs and when enumerating more complex $(p,q)$-cliques, validating its practical impact for privacy-preserving subgraph analysis.
Abstract
(p,q)-clique enumeration on a bipartite graph is critical for calculating clustering coefficient and detecting densest subgraph. It is necessary to carry out subgraph enumeration while protecting users' privacy from any potential attacker as the count of subgraph may contain sensitive information. Most recent studies focus on the privacy protection algorithms based on edge LDP (Local Differential Privacy). However, these algorithms suffer a large estimation error due to the great amount of required noise. In this paper, we propose a novel idea of k-stars LDP and a novel k-stars LDP algorithm for (p, q)-clique enumeration with a small estimation error, where a k-stars is a star-shaped graph with k nodes connecting to one node. The effectiveness of edge LDP relies on its capacity to obfuscate the existence of an edge between the user and his one-hop neighbors. This is based on the premise that a user should be aware of the existence of his one-hop neighbors. Similarly, we can apply this premise to k-stars as well, where an edge is a specific genre of 1-stars. Based on this fact, we first propose the k-stars neighboring list to enable our algorithm to obfuscate the existence of k-stars with Warner' s RR. Then, we propose the absolute value correction technique and the k-stars sampling technique to further reduce the estimation error. Finally, with the two-round user-collector interaction mechanism, we propose our k-stars LDP algorithm to count the number of (p, q)-clique while successfully protecting users' privacy. Both the theoretical analysis and experiments have showed the superiority of our algorithm over the algorithms based on edge LDP.
