Training Set Reconstruction from Differentially Private Forests: How Effective is DP?
Alice Gorgé, Julien Ferry, Sébastien Gambs, Thibaut Vidal
TL;DR
The paper investigates whether training data in differentially private random forests can be reconstructed. It introduces a constraint-programming–based reconstruction attack that leverages the forest structure and DP noise to recover a most likely training dataset, and it evaluates this approach on three tabular datasets across multiple DP budgets and forest configurations. The findings reveal that, although DP attenuates leakage, meaningful reconstruction of training data remains possible for non-trivial $\varepsilon$ values, with leakage often extending to dataset-specific details rather than mere distributional patterns; only extremely sparse, trivially predictive forests appear robust. The work highlights the need for careful DP mechanism design and hyperparameter tuning to balance privacy and utility, and it offers practical recommendations and a mathematical framework for privacy assessment of DP RFs in real-world deployments.
Abstract
Recent research has shown that structured machine learning models such as tree ensembles are vulnerable to privacy attacks targeting their training data. To mitigate these risks, differential privacy (DP) has become a widely adopted countermeasure, as it offers rigorous privacy protection. In this paper, we introduce a reconstruction attack targeting state-of-the-art $ε$-DP random forests. By leveraging a constraint programming model that incorporates knowledge of the forest's structure and DP mechanism characteristics, our approach formally reconstructs the most likely dataset that could have produced a given forest. Through extensive computational experiments, we examine the interplay between model utility, privacy guarantees and reconstruction accuracy across various configurations. Our results reveal that random forests trained with meaningful DP guarantees can still leak portions of their training data. Specifically, while DP reduces the success of reconstruction attacks, the only forests fully robust to our attack exhibit predictive performance no better than a constant classifier. Building on these insights, we also provide practical recommendations for the construction of DP random forests that are more resilient to reconstruction attacks while maintaining a non-trivial predictive performance.
