A robust assessment for invariant representations
Wenlu Tang, Zicheng Liu
TL;DR
The paper addresses evaluating invariant representations under covariate-shifted environments in invariant risk minimization (IRM). It introduces the Covariate-shift Representation Invariance Criterion (CRIC), defined via a likelihood-ratio weighted invariance condition that compares environment-specific expectations and summarizes this with a normalized statistic Q_Phi. The authors provide an empirical estimator for CRIC, establish consistency results, and validate the approach through synthetic SEM experiments and real financial data, showing IRM-based methods achieve lower CRIC than ERM, with REx-V often outperforming IRMv1 in complex settings. They also discuss integrating CRIC with prediction accuracy as a multi-objective framework, highlighting CRIC as a robust, complementary criterion for evaluating invariant representations and guiding domain-generalization research.
Abstract
The performance of machine learning models can be impacted by changes in data over time. A promising approach to address this challenge is invariant learning, with a particular focus on a method known as invariant risk minimization (IRM). This technique aims to identify a stable data representation that remains effective with out-of-distribution (OOD) data. While numerous studies have developed IRM-based methods adaptive to data augmentation scenarios, there has been limited attention on directly assessing how well these representations preserve their invariant performance under varying conditions. In our paper, we propose a novel method to evaluate invariant performance, specifically tailored for IRM-based methods. We establish a bridge between the conditional expectation of an invariant predictor across different environments through the likelihood ratio. Our proposed criterion offers a robust basis for evaluating invariant performance. We validate our approach with theoretical support and demonstrate its effectiveness through extensive numerical studies.These experiments illustrate how our method can assess the invariant performance of various representation techniques.
