Table of Contents
Fetching ...

Cross-Fitting-Free Debiased Machine Learning with Multiway Dependence

Kaicheng Chen, Harold D. Chiang

TL;DR

The paper develops a cross-fitting–free asymptotic theory for two-step debiased machine learning in GMM models with general multiway clustered dependence, enabling valid inference without sample splitting in the presence of arbitrarily many clustering dimensions. It combines Neyman orthogonality with localisation to control first-stage estimation effects, deriving both global and local maximal inequalities for separately exchangeable arrays to establish asymptotic linearity and normality. A central result provides an explicit linear representation and variance formula, with rate conditions that accommodate flexible, high-dimensional nuisance learners (e.g., sparse GLMs, regression trees, and deep networks). The work delivers a practical inference framework for complex clustered environments and contributes new probabilistic tools of independent interest for multiway dependence.

Abstract

This paper develops an asymptotic theory for two-step debiased machine learning (DML) estimators in generalised method of moments (GMM) models with general multiway clustered dependence, without relying on cross-fitting. While cross-fitting is commonly employed, it can be statistically inefficient and computationally burdensome when first-stage learners are complex and the effective sample size is governed by the number of independent clusters. We show that valid inference can be achieved without sample splitting by combining Neyman-orthogonal moment conditions with a localisation-based empirical process approach, allowing for an arbitrary number of clustering dimensions. The resulting DML-GMM estimators are shown to be asymptotically linear and asymptotically normal under multiway clustered dependence. A central technical contribution of the paper is the derivation of novel global and local maximal inequalities for general classes of functions of sums of separately exchangeable arrays, which underpin our theoretical arguments and are of independent interest.

Cross-Fitting-Free Debiased Machine Learning with Multiway Dependence

TL;DR

The paper develops a cross-fitting–free asymptotic theory for two-step debiased machine learning in GMM models with general multiway clustered dependence, enabling valid inference without sample splitting in the presence of arbitrarily many clustering dimensions. It combines Neyman orthogonality with localisation to control first-stage estimation effects, deriving both global and local maximal inequalities for separately exchangeable arrays to establish asymptotic linearity and normality. A central result provides an explicit linear representation and variance formula, with rate conditions that accommodate flexible, high-dimensional nuisance learners (e.g., sparse GLMs, regression trees, and deep networks). The work delivers a practical inference framework for complex clustered environments and contributes new probabilistic tools of independent interest for multiway dependence.

Abstract

This paper develops an asymptotic theory for two-step debiased machine learning (DML) estimators in generalised method of moments (GMM) models with general multiway clustered dependence, without relying on cross-fitting. While cross-fitting is commonly employed, it can be statistically inefficient and computationally burdensome when first-stage learners are complex and the effective sample size is governed by the number of independent clusters. We show that valid inference can be achieved without sample splitting by combining Neyman-orthogonal moment conditions with a localisation-based empirical process approach, allowing for an arbitrary number of clustering dimensions. The resulting DML-GMM estimators are shown to be asymptotically linear and asymptotically normal under multiway clustered dependence. A central technical contribution of the paper is the derivation of novel global and local maximal inequalities for general classes of functions of sums of separately exchangeable arrays, which underpin our theoretical arguments and are of independent interest.
Paper Structure (16 sections, 13 theorems, 111 equations)

This paper contains 16 sections, 13 theorems, 111 equations.

Key Result

Theorem 1

Let $\widehat{\theta}$ be a solution to $(foc)$. Suppose Assumption assu_gmm_reg holds, and $\widehat{\theta}\overset{p}{\to} \theta_0$, $\widehat{\Upsilon}\overset{p}{\to} \Upsilon$ for some positive-definite limit $\Upsilon$ as $n\to\infty$. For each $n\in \mathbb N$, let $F_n$ be an envelope for then we have (i) the following linear representation holds (ii) If, additionally, (a) it holds tha

Theorems & Definitions (22)

  • Remark 1: On $B$ in Assumption \ref{['assu_gmm_reg']}(iii)
  • Theorem 1: Asymptotic linearity and normality
  • Remark 2: Choosing the envelopes $F_n$
  • Remark 3: Intuition behind localisation approach
  • Remark 4: Alternative asymptotics for semiparametric estimation
  • Remark 5: Multiway-clustering stability
  • Theorem 2: Complexity/rate for machine learners
  • Theorem 3: Consistent variance estimation
  • Theorem 4: Global maximal inequality for SE processes
  • Theorem 5: Local maximal inequality for SE processes
  • ...and 12 more