Fundamental Computational Limits in Pursuing Invariant Causal Prediction and Invariance-Guided Regularization
Yihong Gu, Cong Fang, Yang Xu, Zijian Guo, Jianqing Fan
TL;DR
This work tackles the fundamental computational limits of learning invariant causal predictors across heterogeneous environments, revealing that the decision problem ExistsLIS-2 is NP-hard even for the linear model with two environments, implying that no polynomial-time method can achieve fast estimation rates in the worst case if $\mathsf{P}\neq\mathsf{NP}$. To address this, the authors introduce invariance-guided regularization (IGR), a simple, computation-budgeted estimator with an ellipse-shaped uncertainty set that downweights spurious directions and yields a smooth interpolation between the most predictive solution and the causal solution via a tunable invariance parameter $\gamma$. They establish a distributionally robust interpretation of the method, provide non-asymptotic analysis, and demonstrate empirical efficacy on real data (stock returns and climate dynamics), showing improved worst-case robustness across environments. Overall, the paper clarifies a fundamental trade-off between computational tractability and causal identifiability in invariance pursuit and offers a practical, theoretically-grounded approach when mild structure is available.
Abstract
Pursuing invariant prediction from heterogeneous environments opens the door to learning causality in a purely data-driven way and has several applications in causal discovery and robust transfer learning. However, existing methods such as ICP [Peters et al., 2016] and EILLS [Fan et al., 2024] that can attain sample-efficient estimation are based on exponential time algorithms. In this paper, we show that such a problem is intrinsically hard in computation: the decision problem, testing whether a non-trivial prediction-invariant solution exists across two environments, is NP-hard even for the linear causal relationship. In the world where P$\neq$NP, our results imply that the estimation error rate can be arbitrarily slow using any computationally efficient algorithm. This suggests that pursuing causality is fundamentally harder than detecting associations when no prior assumption is pre-offered. Given there is almost no hope of computational improvement under the worst case, this paper proposes a method capable of attaining both computationally and statistically efficient estimation under additional conditions. Furthermore, our estimator is a distributionally robust estimator with an ellipse-shaped uncertain set where more uncertainty is placed on spurious directions than invariant directions, resulting in a smooth interpolation between the most predictive solution and the causal solution by varying the invariance hyper-parameter. Non-asymptotic results and empirical applications support the claim.
