Unified Locational Differential Privacy Framework
Aman Priyanshu, Yash Maurya, Suriya Ganesh, Vy Tran
TL;DR
We address private geographical data analysis by proposing a unified locational differential privacy framework that supports diverse data types (one-hot vectors, booleans, integers, and floats) over geographic regions. The framework uses local DP mechanisms—randomized response, the exponential mechanism, and the Gaussian mechanism—implemented with Diffprivlib and tracked with Opacus, including shuffling for privacy amplification. It is evaluated on four simulated location-aggregation datasets, with findings showing that increasing the privacy budget $\epsilon$ improves utility (lower MSE), and Gaussian/Exponential mechanisms generally outperform randomized response for numerical data. The work offers a practical, extensible toolkit for privacy-preserving geographic analysis and may be released as open-source to facilitate adoption and further research.
Abstract
Aggregating statistics over geographical regions is important for many applications, such as analyzing income, election results, and disease spread. However, the sensitive nature of this data necessitates strong privacy protections to safeguard individuals. In this work, we present a unified locational differential privacy (DP) framework to enable private aggregation of various data types, including one-hot encoded, boolean, float, and integer arrays, over geographical regions. Our framework employs local DP mechanisms such as randomized response, the exponential mechanism, and the Gaussian mechanism. We evaluate our approach on four datasets representing significant location data aggregation scenarios. Results demonstrate the utility of our framework in providing formal DP guarantees while enabling geographical data analysis.
