Perturbed Double Machine Learning: Nonstandard Inference Beyond the Parametric Length

Mengchu Zheng; Matteo Bonvini; Zijian Guo

Perturbed Double Machine Learning: Nonstandard Inference Beyond the Parametric Length

Mengchu Zheng, Matteo Bonvini, Zijian Guo

TL;DR

The proposal is to inject randomness into the nuisance estimation step to generate perturbed nuisance models, each yielding an estimate of $\beta$ and a Wald interval, and to filter out perturbations whose deviations from the original DML estimate exceed a threshold.

Abstract

We study inference on a low-dimensional functional $β$ in the presence of infinite-dimensional nuisance parameters. Classical inferential methods are typically based on Wald intervals, whose large-sample validity rests on asymptotic negligibility of nuisance error; for example, influence-curve based estimators (Double/Debiased Machine Learning, DML) are asymptotically Gaussian when nuisance estimators converge faster than $n^{-1/4}$. Although such negligibility can hold even in nonparametric classes, it can be restrictive. To relax this requirement, we propose Perturbed Double Machine Learning, which ensures valid inference even when nuisance estimators converge slower than $n^{-1/4}$. Our proposal is to (i) inject randomness into the nuisance estimation step to generate perturbed nuisance models, each yielding an estimate of $β$ and a Wald interval, and (ii) filter out perturbations whose deviations from the original DML estimate exceed a threshold. For Lasso nuisance learners, we show that, with high probability, at least one perturbation yields nuisance estimates sufficiently close to the truth, so the associated estimator of $β$ is close to an oracle with known nuisances. The union of retained intervals delivers valid coverage even when the DML estimator converges slower than $n^{-1/2}$. The framework extends to general machine-learning nuisance learners, and simulations show coverage when state-of-the-art methods fail.

Perturbed Double Machine Learning: Nonstandard Inference Beyond the Parametric Length

TL;DR

The proposal is to inject randomness into the nuisance estimation step to generate perturbed nuisance models, each yielding an estimate of

and a Wald interval, and to filter out perturbations whose deviations from the original DML estimate exceed a threshold.

Abstract

We study inference on a low-dimensional functional

in the presence of infinite-dimensional nuisance parameters. Classical inferential methods are typically based on Wald intervals, whose large-sample validity rests on asymptotic negligibility of nuisance error; for example, influence-curve based estimators (Double/Debiased Machine Learning, DML) are asymptotically Gaussian when nuisance estimators converge faster than

. Although such negligibility can hold even in nonparametric classes, it can be restrictive. To relax this requirement, we propose Perturbed Double Machine Learning, which ensures valid inference even when nuisance estimators converge slower than

. Our proposal is to (i) inject randomness into the nuisance estimation step to generate perturbed nuisance models, each yielding an estimate of

and a Wald interval, and (ii) filter out perturbations whose deviations from the original DML estimate exceed a threshold. For Lasso nuisance learners, we show that, with high probability, at least one perturbation yields nuisance estimates sufficiently close to the truth, so the associated estimator of

is close to an oracle with known nuisances. The union of retained intervals delivers valid coverage even when the DML estimator converges slower than

. The framework extends to general machine-learning nuisance learners, and simulations show coverage when state-of-the-art methods fail.

Perturbed Double Machine Learning: Nonstandard Inference Beyond the Parametric Length

TL;DR

Abstract

Perturbed Double Machine Learning: Nonstandard Inference Beyond the Parametric Length

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (16)

Theorems & Definitions (21)