Cross-Fitting-Free Debiased Machine Learning with Multiway Dependence

Kaicheng Chen; Harold D. Chiang

Cross-Fitting-Free Debiased Machine Learning with Multiway Dependence

Kaicheng Chen, Harold D. Chiang

TL;DR

The paper develops a cross-fitting–free asymptotic theory for two-step debiased machine learning in GMM models with general multiway clustered dependence, enabling valid inference without sample splitting in the presence of arbitrarily many clustering dimensions. It combines Neyman orthogonality with localisation to control first-stage estimation effects, deriving both global and local maximal inequalities for separately exchangeable arrays to establish asymptotic linearity and normality. A central result provides an explicit linear representation and variance formula, with rate conditions that accommodate flexible, high-dimensional nuisance learners (e.g., sparse GLMs, regression trees, and deep networks). The work delivers a practical inference framework for complex clustered environments and contributes new probabilistic tools of independent interest for multiway dependence.

Abstract

This paper develops an asymptotic theory for two-step debiased machine learning (DML) estimators in generalised method of moments (GMM) models with general multiway clustered dependence, without relying on cross-fitting. While cross-fitting is commonly employed, it can be statistically inefficient and computationally burdensome when first-stage learners are complex and the effective sample size is governed by the number of independent clusters. We show that valid inference can be achieved without sample splitting by combining Neyman-orthogonal moment conditions with a localisation-based empirical process approach, allowing for an arbitrary number of clustering dimensions. The resulting DML-GMM estimators are shown to be asymptotically linear and asymptotically normal under multiway clustered dependence. A central technical contribution of the paper is the derivation of novel global and local maximal inequalities for general classes of functions of sums of separately exchangeable arrays, which underpin our theoretical arguments and are of independent interest.

Cross-Fitting-Free Debiased Machine Learning with Multiway Dependence

TL;DR

Abstract

Paper Structure (16 sections, 13 theorems, 111 equations)

This paper contains 16 sections, 13 theorems, 111 equations.

Introduction
Setups and Notations
Debiased Machine Learning for GMM
Main results for the DML GMM estimator.
Verification of complexity and rate conditions: three examples
Variance estimation.
Maximum Inequalities under separate exchangeability
Conclusion
Proofs of the main results
Proof of Theorem \ref{['thm:gmm']}
Proof of Theorem \ref{['eg:vctype']}
Proof of Theorem \ref{['thm:var']}
Proof of Theorem \ref{['theorem:global_maximal_ineq']}
Proof of Theorem \ref{['theorem:local_max_ineq']}
Proof of Corollary \ref{['cor:local_maximal_ineq_VC']}
...and 1 more sections

Key Result

Theorem 1

Let $\widehat{\theta}$ be a solution to $(foc)$. Suppose Assumption assu_gmm_reg holds, and $\widehat{\theta}\overset{p}{\to} \theta_0$, $\widehat{\Upsilon}\overset{p}{\to} \Upsilon$ for some positive-definite limit $\Upsilon$ as $n\to\infty$. For each $n\in \mathbb N$, let $F_n$ be an envelope for then we have (i) the following linear representation holds (ii) If, additionally, (a) it holds tha

Theorems & Definitions (22)

Remark 1: On $B$ in Assumption \ref{['assu_gmm_reg']}(iii)
Theorem 1: Asymptotic linearity and normality
Remark 2: Choosing the envelopes $F_n$
Remark 3: Intuition behind localisation approach
Remark 4: Alternative asymptotics for semiparametric estimation
Remark 5: Multiway-clustering stability
Theorem 2: Complexity/rate for machine learners
Theorem 3: Consistent variance estimation
Theorem 4: Global maximal inequality for SE processes
Theorem 5: Local maximal inequality for SE processes
...and 12 more

Cross-Fitting-Free Debiased Machine Learning with Multiway Dependence

TL;DR

Abstract

Cross-Fitting-Free Debiased Machine Learning with Multiway Dependence

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (22)