Score-Preserving Targeted Maximum Likelihood Estimation
Noel Pimentel, Alejandro Schuler, Mark van der Laan
TL;DR
Score-Preserving TMLE (SP-TMLE) extends targeted maximum likelihood estimation by identifying and preserving the scores solved by an initial estimator and updating along a multi-dimensional submodel that spans both the efficient influence function and these scores. This approach aims to reduce the remainder term in the von Mises expansion, improving finite-sample bias and variance, and it is demonstrated via simulations using HAL as the initial fit, showing reduced bias relative to plug-in HAL and reduced variance with respect to vanilla TMLE, along with better standard error estimation and coverage. The method couples TMLE with explicit score equations, offering a practical best-of-both-worlds performance in small samples, while acknowledging reduced generality due to the need for explicit score structures. Overall, SP-TMLE blends the robustness of TMLE with enhanced efficiency from score-solving, providing a promising tool for finite-sample semiparametric estimation, particularly in treatment-specific mean settings.
Abstract
Targeted maximum likelihood estimators (TMLEs) are asymptotically optimal among regular, asymptotically linear estimators. In small samples, however, we may be far from "asymptopia" and not reap the benefits of optimality. Here we propose a variant (score-preserving TMLE; SP-TMLE) that leverages an initial estimator defined as the solution of a large number of possibly data-dependent score equations. Instead of targeting only the efficient influence function in the TMLE update to knock out the plug-in bias, we also target the already-solved scores. Solving additional scores reduces the remainder term in the von-Mises expansion of our estimator because these scores may come close to spanning higher-order influence functions. The result is an estimator with better finite-sample performance. We demonstrate our approach in simulation studies leveraging the (relaxed) highly adaptive lasso (HAL) as our initial estimator. These simulations show that in small samples SP-TMLE has reduced bias relative to plug-in HAL and reduced variance relative to vanilla TMLE, blending the advantages of the two approaches. We also observe improved estimation of standard errors in small samples.
