Collective Privacy Recovery: Data-sharing Coordination via Decentralized Artificial Intelligence

Evangelos Pournaras; Mark Christopher Ballandies; Stefano Bennati; Chien-fei Chen

Collective Privacy Recovery: Data-sharing Coordination via Decentralized Artificial Intelligence

Evangelos Pournaras, Mark Christopher Ballandies, Stefano Bennati, Chien-fei Chen

TL;DR

This work tackles collective privacy loss by modeling data-sharing as a scarce resource and enabling decentralized coordination to minimize data sharing while maintaining service quality. It formalizes a multi-criteria, personalized-valuation framework with $k$ criteria and $m=\prod_{u=1}^k l_u$ scenarios, assigning rewards via $\hat{r}_{i,j}$ and adjusting privacy through $p_i$, all under a structured recruitment and living-lab protocol. The authors validate the approach in a large, living-lab style experiment using causal inference and cluster analysis to identify data-sharing behaviors and five signal patterns for coordination, demonstrating that coordinated data sharing can yield privacy gains with manageable costs to service providers. They further contrast valuation schemes (absolute/relative, shared/sacrificed data) and show coordinated data sharing often outperforms intrinsic or rewarded sharing in terms of privacy recovery, supported by extensive conjoint analysis and ANOVA. Overall, the study provides a scalable, AI-driven pathway for collective privacy recovery with practical implications for privacy-aware data ecosystems.

Abstract

Collective privacy loss becomes a colossal problem, an emergency for personal freedoms and democracy. But, are we prepared to handle personal data as scarce resource and collectively share data under the doctrine: as little as possible, as much as necessary? We hypothesize a significant privacy recovery if a population of individuals, the data collective, coordinates to share minimum data for running online services with the required quality. Here we show how to automate and scale-up complex collective arrangements for privacy recovery using decentralized artificial intelligence. For this, we compare for first time attitudinal, intrinsic, rewarded and coordinated data sharing in a rigorous living-lab experiment of high realism involving >27,000 real data disclosures. Using causal inference and cluster analysis, we differentiate criteria predicting privacy and five key data-sharing behaviors. Strikingly, data-sharing coordination proves to be a win-win for all: remarkable privacy recovery for people with evident costs reduction for service providers.

Collective Privacy Recovery: Data-sharing Coordination via Decentralized Artificial Intelligence

TL;DR

criteria and

scenarios, assigning rewards via

and adjusting privacy through

, all under a structured recruitment and living-lab protocol. The authors validate the approach in a large, living-lab style experiment using causal inference and cluster analysis to identify data-sharing behaviors and five signal patterns for coordination, demonstrating that coordinated data sharing can yield privacy gains with manageable costs to service providers. They further contrast valuation schemes (absolute/relative, shared/sacrificed data) and show coordinated data sharing often outperforms intrinsic or rewarded sharing in terms of privacy recovery, supported by extensive conjoint analysis and ANOVA. Overall, the study provides a scalable, AI-driven pathway for collective privacy recovery with practical implications for privacy-aware data ecosystems.

Abstract

Paper Structure (23 sections, 13 equations, 16 figures, 19 tables)

This paper contains 23 sections, 13 equations, 16 figures, 19 tables.

General Data-Sharing Model
Data-sharing criteria
A weighting scheme for personalized privacy valuation
Calculating rewards and privacy
Recruitment Process
Recruitment sessions
E-mail invitation for recruitment
Experimental Design
Preparatory phase
Entry phase
Core phase
Exit phase
Compensation and monetary incentives
Implementation of the Technical Infrastructure
The Privacy and Rewards Gain of Data-sharing Scenarios
...and 8 more sections

Figures (16)

Figure 1: Instructions presented to the participants starting with the entry phase.
Figure 2: Information consent for participation in the designed experiment.
Figure 3: Screens of the Android app during the entry phase.
Figure 4: Data used from Question A.9, A.13 (yellow bars, intrusion) and A.6 (red bars, probability) of the preparatory phase for choosing the elements of the data-sharing criteria for the factorial experiment.
Figure 5: Instructions presented to the participants starting the core phase and after finishing the entry phase.
...and 11 more figures

Collective Privacy Recovery: Data-sharing Coordination via Decentralized Artificial Intelligence

TL;DR

Abstract

Collective Privacy Recovery: Data-sharing Coordination via Decentralized Artificial Intelligence

Authors

TL;DR

Abstract

Table of Contents

Figures (16)