Table of Contents
Fetching ...

Improved Inference for CSDID Using the Cluster Jackknife

Sunny R. Karim, Morten Ørregaard Nielsen, James G. MacKinnon, Matthew D. Webb

TL;DR

This paper addresses unreliable cluster-robust inference for modern difference-in-differences, focusing on the CSDID estimator under staggered adoption. It introduces cluster jackknife CV$_3$ standard errors as a practical and robust alternative to RIF-based and multiplier bootstrap approaches, demonstrating via Monte Carlo simulations and empirical examples that it markedly improves finite-sample inference when cluster counts are small or treated clusters are few. The authors provide open-source implementations in Stata (csdidjack) and R (didjack) to facilitate adoption. Overall, the work offers a concrete fix to inferential shortcomings in modern DiD methods, with clear guidance on when and how to use cluster jackknife in applied settings.

Abstract

Obtaining reliable inferences with traditional difference-in-differences (DiD) methods can be difficult. Problems can arise when both outcomes and errors are serially correlated, when there are few clusters or few treated clusters, when cluster sizes vary greatly, and in various other cases. In recent years, recognition of the ``staggered adoption'' problem has shifted the focus away from inference towards consistent estimation of treatment effects. One of the most popular new estimators is the CSDID procedure of Callaway and Sant'Anna (2021). We find that the issues of over-rejection with few clusters and/or few treated clusters are at least as severe for CSDID as for traditional DiD methods. We also propose using a cluster jackknife for inference with CSDID, which simulations suggest greatly improves inference. We provide software packages in Stata csdidjack and R didjack to calculate cluster-jackknife standard errors easily.

Improved Inference for CSDID Using the Cluster Jackknife

TL;DR

This paper addresses unreliable cluster-robust inference for modern difference-in-differences, focusing on the CSDID estimator under staggered adoption. It introduces cluster jackknife CV standard errors as a practical and robust alternative to RIF-based and multiplier bootstrap approaches, demonstrating via Monte Carlo simulations and empirical examples that it markedly improves finite-sample inference when cluster counts are small or treated clusters are few. The authors provide open-source implementations in Stata (csdidjack) and R (didjack) to facilitate adoption. Overall, the work offers a concrete fix to inferential shortcomings in modern DiD methods, with clear guidance on when and how to use cluster jackknife in applied settings.

Abstract

Obtaining reliable inferences with traditional difference-in-differences (DiD) methods can be difficult. Problems can arise when both outcomes and errors are serially correlated, when there are few clusters or few treated clusters, when cluster sizes vary greatly, and in various other cases. In recent years, recognition of the ``staggered adoption'' problem has shifted the focus away from inference towards consistent estimation of treatment effects. One of the most popular new estimators is the CSDID procedure of Callaway and Sant'Anna (2021). We find that the issues of over-rejection with few clusters and/or few treated clusters are at least as severe for CSDID as for traditional DiD methods. We also propose using a cluster jackknife for inference with CSDID, which simulations suggest greatly improves inference. We provide software packages in Stata csdidjack and R didjack to calculate cluster-jackknife standard errors easily.
Paper Structure (17 sections, 25 equations, 1 figure, 5 tables)

This paper contains 17 sections, 25 equations, 1 figure, 5 tables.

Figures (1)

  • Figure 1: ATT$^{(h)}$ vs. cluster size for minimum wage example

Theorems & Definitions (4)

  • Example 1: Staggered Adoption, $3\times3$, Single Never-Treated Control Region
  • Example 2: Staggered Adoption, $3\times3$, Single Never-Treated Control Region, CSDID
  • Example 3: Staggered Adoption, $6\times3$, Two Never-Treated Control Regions
  • Example 4: Staggered Adoption, $3\times4$, All Regions Treated