Causal Inference for Experiments with Latent Outcomes: Key Results and Their Implications for Design and Analysis
Jiawei Fu, Donald P. Green
TL;DR
The paper tackles estimating causal effects when outcomes are latent and measured with error using multiple proxies. It introduces a design-based framework where latent outcomes are linked to observed measures via scaling parameters identified with instrumental variables, addressing study-specific noncomparability by fixing a reference scale. It develops two estimation paths—the optimally weighted scaled index (WSI) and structural equation modeling (SEM)—demonstrating that, with proper scaling, the ALTE can be efficiently and robustly estimated and compared across studies. The analysis provides practical guidance on when to collect more outcome measures versus more subjects, and emphasizes robustness checks, nonparametric extensions, and measurement-equivalence tests. An empirical application shows substantial gains in precision and robustness when using multiple latent measures, supporting broader adoption of design-based latent-outcome inference in experimental settings.
Abstract
How should researchers analyze randomized experiments in which the main outcome is latent and measured in multiple ways but each measure contains some degree of error? We first identify a critical study-specific noncomparability problem in existing methods for handling multiple measurements, which often rely on strong modeling assumptions or arbitrary standardization. Such approaches render the resulting estimands noncomparable across studies. To address the problem, we describe design-based approaches that enable researchers to identify causal parameters of interest, suggest ways that experimental designs can be augmented so as to make assumptions more credible, and discuss empirical tests of key assumptions. We show that when experimental researchers invest appropriately in multiple outcome measures, an optimally weighted scaled index of these measures enables researchers to obtain efficient and interpretable estimates of causal parameters by applying standard regression. An empirical application illustrates the gains in precision and robustness that multiple outcome measures can provide.
