Table of Contents
Fetching ...

Residual Balancing for Non-Linear Outcome Models in High Dimensions

Isaac Meza

TL;DR

The paper tackles high-dimensional causal inference with nonlinear outcomes by extending Approximate Residual Balancing (ARB) to generalized linear models. It shows that ignoring the link function's curvature induces bias in nonlinear settings and introduces a second-order balancing correction that weights covariates by $\psi'(\cdot)$ and their outer products by $\psi''(\cdot)$. A concrete optimization problem yields balancing weights that ensure $\sqrt{n}$-consistency and asymptotic normality for the ATT, without estimating a propensity score, and cross-fitting provides robust inference. The authors extend the framework to a smooth single-index model (SIM) with unknown link, detailing a three-fold splitting scheme and nonparametric derivative estimation. Overall, the work delivers a principled, practically implementable method for valid inference in high-dimensional nonlinear causal models, broadening the applicability of ARB to more complex outcome structures.

Abstract

We extend the approximate residual balancing (ARB) framework to nonlinear models, answering an open problem posed by Athey et al. (2018). Our approach addresses the challenge of estimating average treatment effects in high-dimensional settings where the outcome follows a generalized linear model. We derive a new bias decomposition for nonlinear models that reveals the need for a second-order correction to account for the curvature of the link function. Based on this insight, we construct balancing weights through an optimization problem that controls for both first and second-order sources of bias. We provide theoretical guarantees for our estimator, establishing its $\sqrt{n}$-consistency and asymptotic normality under standard high-dimensional assumptions.

Residual Balancing for Non-Linear Outcome Models in High Dimensions

TL;DR

The paper tackles high-dimensional causal inference with nonlinear outcomes by extending Approximate Residual Balancing (ARB) to generalized linear models. It shows that ignoring the link function's curvature induces bias in nonlinear settings and introduces a second-order balancing correction that weights covariates by and their outer products by . A concrete optimization problem yields balancing weights that ensure -consistency and asymptotic normality for the ATT, without estimating a propensity score, and cross-fitting provides robust inference. The authors extend the framework to a smooth single-index model (SIM) with unknown link, detailing a three-fold splitting scheme and nonparametric derivative estimation. Overall, the work delivers a principled, practically implementable method for valid inference in high-dimensional nonlinear causal models, broadening the applicability of ARB to more complex outcome structures.

Abstract

We extend the approximate residual balancing (ARB) framework to nonlinear models, answering an open problem posed by Athey et al. (2018). Our approach addresses the challenge of estimating average treatment effects in high-dimensional settings where the outcome follows a generalized linear model. We derive a new bias decomposition for nonlinear models that reveals the need for a second-order correction to account for the curvature of the link function. Based on this insight, we construct balancing weights through an optimization problem that controls for both first and second-order sources of bias. We provide theoretical guarantees for our estimator, establishing its -consistency and asymptotic normality under standard high-dimensional assumptions.

Paper Structure

This paper contains 36 sections, 5 theorems, 87 equations.

Key Result

Proposition 1

Under Assumption assu:linearity, then where $\psi^\prime(X_t\beta_c)\in\mathbb{R}^n$, $W_c(\beta_c) = \operatorname{diag}(\psi^{\prime}(X_c \beta_c))\in\mathbb{R}^{n_c\times n_c}$ , $\varepsilon_{i}=Y_{i}(0)-\psi(X_{i}^\top \beta_c)$ is the intrinsic noise, and $\|X_{(\cdot)}\|_\infty$ refers to the entry-wise maximum absolute value.

Theorems & Definitions (15)

  • Example 2.1
  • Proposition 1
  • Theorem 1
  • Remark 1
  • Theorem 2
  • Example 3.1: GLM/M-estimation negahban_2012vandeGeer
  • Example 3.2: Isotropic/whitened designs plan2015generalizedlassononlinearobservations
  • Theorem 3
  • Remark 2
  • Theorem 4
  • ...and 5 more