Information Criteria for Selecting Parton Distribution Function Solutions
Aurore Courtoy, Arturo Ibsen
TL;DR
This work tackles the non-uniqueness of parton distribution function (PDF) solutions arising from ill-posed inverse problems in global QCD analyses. It introduces an absolute shape criterion based on Rényi entropy $H_2(X)$ to classify and select diverse PDF shapes, and complements it with relative criteria via Rényi divergence $D_2$ and Wasserstein distance $W_1$, using Pareto fronts to identify representative metamorphs. The methods are demonstrated on Fantômas pion PDFs, showing that Rényi-entropy-driven selection yields results qualitatively consistent with relative metrics and also supports clustering with UMAP. The approach provides a computationally efficient, minimalistic framework for capturing model diversity and guiding robust PDF ensembles, with clear paths for extension to proton PDFs and higher flavor-dimensional spaces.
Abstract
In data-driven determination of Parton Distribution Functions (PDFs) in global QCD analyses, uncovering the true underlying distributions is complicated by a highly convoluted inverse problem. The determination of PDFs can be understood as the inference of a function supported on $[0,1]$, a problem that admits multiple acceptable solutions. An ensemble of solutions exists that pass all standard goodness-of-fit criteria. In this paper, we propose algorithms for the classification, clustering, and selection of solutions to the determination of PDFs, or any functions on $[0,1]$, based on the characterization of their shape. We explore information-theoretic based (Rényi entropy and divergence) and optimal-transport based (Wasserstein distance) criteria. In particular, we advocate for the use of the Rényi entropy as an {\it absolute} estimator per solution, as opposed to {\it relative} estimators that compare solutions pairwise. We show that the Rényi entropy can characterize the space of solutions {\it w.r.t.} the PDF shapes. Paired with the identification of the optimal combination of solutions via Pareto fronts, it provides a plausible and minimalist selection algorithm. Moreover, Rényi entropy proves versatile for use in clustering applications.
