Unbiased Estimating Equation on Inverse Divergence and Its Conditions
Masahiro Kobayashi, Kazuho Watanabe
TL;DR
The paper addresses when the estimating equation for the loss $L(oldsymbol{ heta}) = rac{1}{n} ext{+++} n f(d_oldsymbol{phi}(oldsymbol{x}_i,oldsymbol{ heta}))$ under inverse divergence is unbiased without a bias-correction term. It introduces two model families, Inverse Gaussian Type (IGT) and Generalized IGT (GIGT) mixtures, and derives explicit integral conditions on $f$ and the generating function $g$ (e.g., $I(1)< ty$ and $ int_0^ty g(t)f'(t) dt<6$) that guarantee unbiasedness, with corresponding results for multi-dimensional data via the MIGT distribution and a double-integral criterion. The main contributions are the precise necessary-and-sufficient conditions for unbiasedness in 1D and the multi-dimensional extension, clarifying how the choice of $f$ and the generating function $g$ interacts with the underlying distribution class. The findings illuminate when latent biases can be effectively eliminated in robust estimation using inverse divergence, linking these estimators to (continuous) Bregman distributions and the regular exponential family. Practically, the work provides rigorous criteria for selecting $f$ and model families to achieve unbiased estimation without computationally intractable bias corrections, even under outliers.
Abstract
This paper focuses on the Bregman divergence defined by the reciprocal function, called the inverse divergence. For the loss function defined by the monotonically increasing function $f$ and inverse divergence, the conditions for the statistical model and function $f$ under which the estimating equation is unbiased are clarified. Specifically, we characterize two types of statistical models, an inverse Gaussian type and a mixture of generalized inverse Gaussian type distributions, to show that the conditions for the function $f$ are different for each model. We also define Bregman divergence as a linear sum over the dimensions of the inverse divergence and extend the results to the multi-dimensional case.
