SynthEval: A Framework for Detailed Utility and Privacy Evaluation of Tabular Synthetic Data
Anton Danholt Lautrup, Tobias Hyrup, Arthur Zimek, Peter Schneider-Kamp
TL;DR
SynthEval addresses the lack of standardized, multi-metric evaluation for tabular synthetic data by providing an open-source, modular framework that treats numeric and categorical attributes uniformly. It offers a comprehensive library of utility and privacy metrics, configurable presets, and a multi-axis benchmark module that supports ranking of multiple synthetic datasets. A nearest-neighbour approach using Gower distance enables mixed-type data without one-hot encoding, facilitating scalable, unbiased comparisons. Through a real-world Hepatitis C dataset demonstration, the paper shows how SynthEval enables flexible benchmarking, model tuning for privacy-utility trade-offs, and clear identification of strengths and weaknesses across generative models, advancing reproducible evaluation of synthetic tabular data.
Abstract
With the growing demand for synthetic data to address contemporary issues in machine learning, such as data scarcity, data fairness, and data privacy, having robust tools for assessing the utility and potential privacy risks of such data becomes crucial. SynthEval, a novel open-source evaluation framework distinguishes itself from existing tools by treating categorical and numerical attributes with equal care, without assuming any special kind of preprocessing steps. This~makes it applicable to virtually any synthetic dataset of tabular records. Our tool leverages statistical and machine learning techniques to comprehensively evaluate synthetic data fidelity and privacy-preserving integrity. SynthEval integrates a wide selection of metrics that can be used independently or in highly customisable benchmark configurations, and can easily be extended with additional metrics. In this paper, we describe SynthEval and illustrate its versatility with examples. The framework facilitates better benchmarking and more consistent comparisons of model capabilities.
