Semiparametric Efficient Empirical Higher Order Influence Function Estimators

Lin Liu; Rajarshi Mukherjee; Whitney K. Newey; James M. Robins

Semiparametric Efficient Empirical Higher Order Influence Function Estimators

Lin Liu, Rajarshi Mukherjee, Whitney K. Newey, James M. Robins

TL;DR

This work introduces empirical Higher Order Influence Function (HOIF) estimators for semiparametric efficient estimation of functionals such as the mean under MAR, notably removing the need to nonparametrically estimate the covariate density $g$. By substituting the population Gram matrix with its empirical counterpart, the estimator achieves $\sqrt{n}$-consistency and efficiency under minimal Hölder smoothness, without requiring $g$ to be smooth. The approach extends to the full class of doubly robust functionals and adapts to unknown smoothness levels, yielding adaptive efficiency via basis choices with optimal approximation properties. Simulations show finite-sample gains when $g$ is rough, with reductions in bias and faster computation compared to density-based HOIFs. Overall, the paper fills a theoretical gap by delivering density-free, adaptive, and efficient HOIF estimators for a broad set of causal functionals under minimal assumptions.

Abstract

Robins et al. (2008, 2017) applied the theory of higher order influence functions (HOIFs) to derive an estimator of the mean $ψ$ of an outcome Y in a missing data model with Y missing at random conditional on a vector X of continuous covariates; their estimator, in contrast to previous estimators, is semiparametric efficient under the minimal conditions of Robins et al. (2009b), together with an additional (non-minimal) smoothness condition on the density g of X, because the Robins et al. (2008, 2017) estimator depends on a nonparametric estimate of g. In this paper, we introduce a new HOIF estimator that has the same asymptotic properties as the original one, but does not impose any smoothness requirement on g. This is important for two reasons. First, one rarely has the knowledge about the properties of g. Second, even when g is smooth, if the dimension of X is even moderate, accurate nonparametric estimation of its density is not feasible at the sample sizes often encountered in applications. In fact, to the best of our knowledge, this new HOIF estimator remains the only semiparametric efficient estimator of $ψ$ under minimal conditions, despite the rapidly growing literature on causal effect estimation. We also show that our estimator can be generalized to the entire class of functionals considered by Robins et al. (2008) which include the average effect of a treatment on a response Y when a vector X suffices to control confounding and the expected conditional variance of a response Y given a vector X. Simulation experiments are also conducted, which demonstrate that our new estimator outperforms those of Robins et al. (2008, 2017) in finite samples, when g is not very smooth.

Semiparametric Efficient Empirical Higher Order Influence Function Estimators

TL;DR

. By substituting the population Gram matrix with its empirical counterpart, the estimator achieves

-consistency and efficiency under minimal Hölder smoothness, without requiring

to be smooth. The approach extends to the full class of doubly robust functionals and adapts to unknown smoothness levels, yielding adaptive efficiency via basis choices with optimal approximation properties. Simulations show finite-sample gains when

is rough, with reductions in bias and faster computation compared to density-based HOIFs. Overall, the paper fills a theoretical gap by delivering density-free, adaptive, and efficient HOIF estimators for a broad set of causal functionals under minimal assumptions.

Abstract

Robins et al. (2008, 2017) applied the theory of higher order influence functions (HOIFs) to derive an estimator of the mean

of an outcome Y in a missing data model with Y missing at random conditional on a vector X of continuous covariates; their estimator, in contrast to previous estimators, is semiparametric efficient under the minimal conditions of Robins et al. (2009b), together with an additional (non-minimal) smoothness condition on the density g of X, because the Robins et al. (2008, 2017) estimator depends on a nonparametric estimate of g. In this paper, we introduce a new HOIF estimator that has the same asymptotic properties as the original one, but does not impose any smoothness requirement on g. This is important for two reasons. First, one rarely has the knowledge about the properties of g. Second, even when g is smooth, if the dimension of X is even moderate, accurate nonparametric estimation of its density is not feasible at the sample sizes often encountered in applications. In fact, to the best of our knowledge, this new HOIF estimator remains the only semiparametric efficient estimator of

under minimal conditions, despite the rapidly growing literature on causal effect estimation. We also show that our estimator can be generalized to the entire class of functionals considered by Robins et al. (2008) which include the average effect of a treatment on a response Y when a vector X suffices to control confounding and the expected conditional variance of a response Y given a vector X. Simulation experiments are also conducted, which demonstrate that our new estimator outperforms those of Robins et al. (2008, 2017) in finite samples, when g is not very smooth.

Semiparametric Efficient Empirical Higher Order Influence Function Estimators

TL;DR

Abstract

Semiparametric Efficient Empirical Higher Order Influence Function Estimators

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (38)