Outcome-Aware Spectral Feature Learning for Instrumental Variable Regression
Dimitri Meunier, Jakub Wornbard, Vladimir R. Kostic, Antoine Moulin, Alek Fröhlich, Karim Lounici, Massimiliano Pontil, Arthur Gretton
TL;DR
The paper tackles causal effect estimation in NPIV under hidden confounding by noting that conventional spectral-feature learning is outcome-agnostic and can misalign with the true causal function. It introduces Augmented Spectral Feature Learning, which augments the operator to include outcome information via a rank-1 perturbation $\mathcal{T}_{\delta}$ and a contrastive loss, yielding task-specific spectral features. The authors provide non-asymptotic, high-probability guarantees for the resulting 2SLS estimator and demonstrate robustness to spectral misalignment through synthetic data and challenging dSprites benchmarks, as well as an Off-Policy Evaluation case in reinforcement learning. The results show that a small positive augmentation parameter $\delta$ often improves performance and broadens the applicability of spectral NPIV methods beyond well-aligned settings, with discussions on higher-rank extensions and practical delta-selection strategies.
Abstract
We address the problem of causal effect estimation in the presence of hidden confounders using nonparametric instrumental variable (IV) regression. An established approach is to use estimators based on learned spectral features, that is, features spanning the top singular subspaces of the operator linking treatments to instruments. While powerful, such features are agnostic to the outcome variable. Consequently, the method can fail when the true causal function is poorly represented by these dominant singular functions. To mitigate, we introduce Augmented Spectral Feature Learning, a framework that makes the feature learning process outcome-aware. Our method learns features by minimizing a novel contrastive loss derived from an augmented operator that incorporates information from the outcome. By learning these task-specific features, our approach remains effective even under spectral misalignment. We provide a theoretical analysis of this framework and validate our approach on challenging benchmarks.
