Adaptive learning of density ratios in RKHS
Werner Zellinger, Stefan Kindermann, Sergei V. Pereverzyev
TL;DR
The paper addresses estimating the density ratio $\beta=\mathrm dP/\mathrm dQ$ from finite samples by learning a density-ratio model in an RKHS through a regularized Bregman-divergence objective. It establishes that the associated losses are generalized self-concordant, derives finite-sample error bounds that depend on a regularity parameter $r$ and a capacity parameter $\alpha$, and proves minimax-optimal rates for the square loss. A novel Lepski-type principle is introduced to adaptively select the regularization parameter $\lambda$ without knowing $r$, achieving the optimal rate in the square-loss case and near-optimal rates in general. The framework is validated by a numerical example and supported by a suite of detailed proofs for both a priori and empirical norm-based parameter choices, offering a practical, theory-backed method for density-ratio estimation in RKHSs.
Abstract
Estimating the ratio of two probability densities from finitely many observations of the densities is a central problem in machine learning and statistics with applications in two-sample testing, divergence estimation, generative modeling, covariate shift adaptation, conditional density estimation, and novelty detection. In this work, we analyze a large class of density ratio estimation methods that minimize a regularized Bregman divergence between the true density ratio and a model in a reproducing kernel Hilbert space (RKHS). We derive new finite-sample error bounds, and we propose a Lepskii type parameter choice principle that minimizes the bounds without knowledge of the regularity of the density ratio. In the special case of quadratic loss, our method adaptively achieves a minimax optimal error rate. A numerical illustration is provided.
