Overcoming Saturation in Density Ratio Estimation by Iterated Regularization
Lukas Gruber, Markus Holzleitner, Johannes Lehner, Sepp Hochreiter, Werner Zellinger
TL;DR
This work tackles the problem of estimating density ratios $\beta=\frac{dP}{dQ}$ from finite samples, where standard kernel methods suffer error saturation and fail to achieve fast convergence rates on highly regular problems. The authors introduce iterated regularization, updating $f^{\lambda,t+1}$ via a Bregman-divergence-based objective to counter saturation and prove non-saturating, fast-rate guarantees under source and capacity conditions; together with a practical optimization scheme using the Representer Theorem and conjugate gradient. Theoretical results show fast rates $\le C (m+n)^{-\frac{2s\alpha}{2s\alpha+1}}$ when the iteration level $t$ satisfies $t\ge r+\tfrac12$, extending saturation-free performance to density-ratio estimation; empirically, iterated methods outperform their non-iterated counterparts on synthetic benchmarks and large-scale unsupervised domain-adaptation ensembles. The practical impact is improved sample efficiency and ensemble performance in domain adaptation tasks, with a public code release enabling replication and broader adoption of iterated regularization in kernel-based density-ratio estimation.
Abstract
Estimating the ratio of two probability densities from finitely many samples, is a central task in machine learning and statistics. In this work, we show that a large class of kernel methods for density ratio estimation suffers from error saturation, which prevents algorithms from achieving fast error convergence rates on highly regular learning problems. To resolve saturation, we introduce iterated regularization in density ratio estimation to achieve fast error rates. Our methods outperform its non-iteratively regularized versions on benchmarks for density ratio estimation as well as on large-scale evaluations for importance-weighted ensembling of deep unsupervised domain adaptation models.
