Uniformly consistent proportion estimation for composite hypotheses via integral equations: "the case of Gamma random variables"
Xiongzhi Chen
TL;DR
This work develops uniformly consistent estimators for the proportion of false null hypotheses under composite nulls for Gamma-distributed variables, using a strategy based on discriminant functions and matching kernels that solve Lebesgue-Stieltjes integral equations. Key innovations include Construction I (bounded null) and Construction II (one-sided null), both leveraging Dirichlet integrals and harmonic analysis to approximate indicator sets without relying on p-values, and an extension to functionals of a bounded null with bounded variation. The Gamma family’s scaling-invariance and separable-moments enable precise control of convergence speeds and uniform consistency classes under independence, with simulations showing competitive performance against MR and Storey methods. The approach broadens applicability to adaptive FDR/FNR procedures for composite-null testing and suggests extensions to other distributions and Lie-group-valued data via harmonic-analytic techniques.
Abstract
We consider estimating the proportion of random variables for two types of composite null hypotheses: (i) the means of the random variables belonging to a non-empty, bounded interval; (ii) the means of the random variables belonging to an unbounded interval that is not the whole real line. For each type of composite null hypotheses, uniformly consistent estimators of the proportion of false null hypotheses are constructed for random variables whose distributions are members of the Gamma family. Further, uniformly consistent estimators of certain functions of a bounded null on the means are provided for the random variables mentioned earlier. These functions are continuous and of bounded variation. The estimators are constructed via solutions to Lebesgue-Stieltjes integral equations and harmonic analysis, do not rely on a concept of p-value, and have various applications.ce via mixture models, and may be used to estimate the sparsity level in high-dimensional Gaussian linear models.
