An Interpretable Evaluation of Entropy-based Novelty of Generative Models
Jingwei Zhang, Cheuk Ting Li, Farzan Farnia
TL;DR
The paper presents a principled framework for evaluating novelty in generative models relative to a reference distribution, focusing on multi-modal data. It introduces the Kernel-based Entropic Novelty (KEN) score, computed from the spectral properties of the differential kernel covariance $\Lambda_{\mathbf{X}|\eta\mathbf{Y}} = C_{\mathbf{X}} - \eta C_{\mathbf{Y}}$, and provides a kernel-trick-based computation via the $\,K_{\mathbf{X}|\eta\mathbf{Y}}$ matrix. Theoretical results show that, for well-separated mixtures, the top eigenvalues approximate mode frequencies, and the positive eigenvalues of $\Lambda_{\mathbf{X}|\eta\mathbf{Y}}$ quantify novelty where X-expressed modes occur $\eta$-times more often than in Y. Empirically, the method detects novel modes in synthetic Gaussian mixtures and real image datasets, reveals interpretability through mode-level clustering, and offers a new distribution-based benchmark to complement existing quality/diversity metrics. Code is provided to enable reproducibility and deployment in model benchmarking tasks.
Abstract
The massive developments of generative model frameworks require principled methods for the evaluation of a model's novelty compared to a reference dataset. While the literature has extensively studied the evaluation of the quality, diversity, and generalizability of generative models, the assessment of a model's novelty compared to a reference model has not been adequately explored in the machine learning community. In this work, we focus on the novelty assessment for multi-modal distributions and attempt to address the following differential clustering task: Given samples of a generative model $P_\mathcal{G}$ and a reference model $P_\mathrm{ref}$, how can we discover the sample types expressed by $P_\mathcal{G}$ more frequently than in $P_\mathrm{ref}$? We introduce a spectral approach to the differential clustering task and propose the Kernel-based Entropic Novelty (KEN) score to quantify the mode-based novelty of $P_\mathcal{G}$ with respect to $P_\mathrm{ref}$. We analyze the KEN score for mixture distributions with well-separable components and develop a kernel-based method to compute the KEN score from empirical data. We support the KEN framework by presenting numerical results on synthetic and real image datasets, indicating the framework's effectiveness in detecting novel modes and comparing generative models. The paper's code is available at: www.github.com/buyeah1109/KEN
