LinkedIn's Audience Engagements API: A Privacy Preserving Data Analytics System at Scale
Ryan Rogers, Subbu Subramaniam, Sean Peng, David Durfee, Seunghyun Lee, Santosh Kumar Kancha, Shraddha Sahay, Parvez Ahammad
TL;DR
This paper presents a production-oriented differential privacy system for LinkedIn's Audience Engagement API, integrating a suite of DP algorithms with a cross-data-center privacy budget management service to support real-time, aggregated marketing analytics. It distinguishes between known/unknown data-domain and restricted/unrestricted sensitivity, mapping each setting to specific DP mechanisms (Laplace and Exponential via Gumbel) and leveraging modern BR composition bounds to tightly bound overall privacy loss. The authors implement the DP stack atop Apache Pinot and an Espresso-based budget store, enabling scalable, consistent DP results across analysts and data centers. Deployment is staged with careful parameter tuning, pseudorandom seeding for consistency, and thorough consideration of potential attacks, culminating in a reported monthly DP guarantee of about (34.9, 7e-9). The work demonstrates the practicality of delivering production-scale, privacy-preserving analytics in a real-time OLAP environment, balancing utility and privacy through explicit budgeting and adaptive query processing.
Abstract
We present a privacy system that leverages differential privacy to protect LinkedIn members' data while also providing audience engagement insights to enable marketing analytics related applications. We detail the differentially private algorithms and other privacy safeguards used to provide results that can be used with existing real-time data analytics platforms, specifically with the open sourced Pinot system. Our privacy system provides user-level privacy guarantees. As part of our privacy system, we include a budget management service that enforces a strict differential privacy budget on the returned results to the analyst. This budget management service brings together the latest research in differential privacy into a product to maintain utility given a fixed differential privacy budget.
