Table of Contents
Fetching ...

Synthetic Combinations: A Causal Inference Framework for Combinatorial Interventions

Abhineet Agarwal, Anish Agarwal, Suhas Vijaykumar

TL;DR

This work introduces Synthetic Combinations, a causal-inference framework for combinatorial interventions that jointly leverages across-unit low-rank structure and across-combination sparsity in a Fourier representation. Potentials are modeled as $Y_n^{(oldsymbol{ au})} = \langle \boldsymbol{\alpha}_n, \boldsymbol{\chi}^{\boldsymbol{\tau}} \rangle + \epsilon_n^{\boldsymbol{\tau}}$, with the Fourier matrix $\mathcal{A}$ of rank $r$ and unit-specific $s$-sparse coefficients, enabling identification under unobserved confounding via a donor-set mechanism. The two-step Synthetic Combinations estimator uses horizontal regression (Lasso or CART) to learn donor outcomes and vertical regression (PCR) to transport these estimates to non-donor units, with finite-sample consistency and asymptotic normality established under precise sampling and incoherence conditions. An experimental-design mechanism guarantees the key assumptions hold with high probability, yielding a data-efficient path to estimating all $N \times 2^p$ potential outcomes and enabling ranking-based extensions. Empirically, the method outperforms baselines on movie-rating data and synthetic simulations, highlighting its potential for factorial designs, recommender systems, and basket therapies where many interventions interact in diverse ways.

Abstract

Consider a setting where there are $N$ heterogeneous units and $p$ interventions. Our goal is to learn unit-specific potential outcomes for any combination of these $p$ interventions, i.e., $N \times 2^p$ causal parameters. Choosing a combination of interventions is a problem that naturally arises in a variety of applications such as factorial design experiments, recommendation engines, combination therapies in medicine, conjoint analysis, etc. Running $N \times 2^p$ experiments to estimate the various parameters is likely expensive and/or infeasible as $N$ and $p$ grow. Further, with observational data there is likely confounding, i.e., whether or not a unit is seen under a combination is correlated with its potential outcome under that combination. To address these challenges, we propose a novel latent factor model that imposes structure across units (i.e., the matrix of potential outcomes is approximately rank $r$), and combinations of interventions (i.e., the coefficients in the Fourier expansion of the potential outcomes is approximately $s$ sparse). We establish identification for all $N \times 2^p$ parameters despite unobserved confounding. We propose an estimation procedure, Synthetic Combinations, and establish it is finite-sample consistent and asymptotically normal under precise conditions on the observation pattern. Our results imply consistent estimation given $\text{poly}(r) \times \left( N + s^2p\right)$ observations, while previous methods have sample complexity scaling as $\min(N \times s^2p, \ \ \text{poly(r)} \times (N + 2^p))$. We use Synthetic Combinations to propose a data-efficient experimental design. Empirically, Synthetic Combinations outperforms competing approaches on a real-world dataset on movie recommendations. Lastly, we extend our analysis to do causal inference where the intervention is a permutation over $p$ items (e.g., rankings).

Synthetic Combinations: A Causal Inference Framework for Combinatorial Interventions

TL;DR

This work introduces Synthetic Combinations, a causal-inference framework for combinatorial interventions that jointly leverages across-unit low-rank structure and across-combination sparsity in a Fourier representation. Potentials are modeled as , with the Fourier matrix of rank and unit-specific -sparse coefficients, enabling identification under unobserved confounding via a donor-set mechanism. The two-step Synthetic Combinations estimator uses horizontal regression (Lasso or CART) to learn donor outcomes and vertical regression (PCR) to transport these estimates to non-donor units, with finite-sample consistency and asymptotic normality established under precise sampling and incoherence conditions. An experimental-design mechanism guarantees the key assumptions hold with high probability, yielding a data-efficient path to estimating all potential outcomes and enabling ranking-based extensions. Empirically, the method outperforms baselines on movie-rating data and synthetic simulations, highlighting its potential for factorial designs, recommender systems, and basket therapies where many interventions interact in diverse ways.

Abstract

Consider a setting where there are heterogeneous units and interventions. Our goal is to learn unit-specific potential outcomes for any combination of these interventions, i.e., causal parameters. Choosing a combination of interventions is a problem that naturally arises in a variety of applications such as factorial design experiments, recommendation engines, combination therapies in medicine, conjoint analysis, etc. Running experiments to estimate the various parameters is likely expensive and/or infeasible as and grow. Further, with observational data there is likely confounding, i.e., whether or not a unit is seen under a combination is correlated with its potential outcome under that combination. To address these challenges, we propose a novel latent factor model that imposes structure across units (i.e., the matrix of potential outcomes is approximately rank ), and combinations of interventions (i.e., the coefficients in the Fourier expansion of the potential outcomes is approximately sparse). We establish identification for all parameters despite unobserved confounding. We propose an estimation procedure, Synthetic Combinations, and establish it is finite-sample consistent and asymptotically normal under precise conditions on the observation pattern. Our results imply consistent estimation given observations, while previous methods have sample complexity scaling as . We use Synthetic Combinations to propose a data-efficient experimental design. Empirically, Synthetic Combinations outperforms competing approaches on a real-world dataset on movie recommendations. Lastly, we extend our analysis to do causal inference where the intervention is a permutation over items (e.g., rankings).
Paper Structure (73 sections, 39 theorems, 167 equations, 6 figures, 4 tables)

This paper contains 73 sections, 39 theorems, 167 equations, 6 figures, 4 tables.

Key Result

Proposition 4.2

Let Assumptions ass:observation_model and Assumptions ass:selection_on_fourier hold. Moreover, suppose that for some $c > 0$, there are at least $cN/r$ units of each type. Then, under the proposed sampling scheme described above, Assumption ass:donor_set_identification is satisfied with probability

Figures (6)

  • Figure 1: A visual description of Synthetic Combinations. Figure \ref{['fig:estimator']}(a) depicts an example of a particular observation pattern with outcome for unit-combination pair $(n,\pi)$ missing. Figure \ref{['fig:estimator']}(b) demonstrates horizontal regression for donor unit $u$ to estimate all potential outcomes $\mathbb{E}[Y_u^{(\Pi)}]$. Figure \ref{['fig:estimator']}(c) displays repeating the horizontal regression for the entire donor set $\mathcal{I}$. Figure \ref{['fig:estimator']}(d) visualizes vertical regression the estimated outcomes from the donor set $\mathcal{I}$ are extrapolated to estimate outcomes for the unit-combination pair $(n,\pi)$.
  • Figure 2: Observation pattern induced by experiment design mechanism.
  • Figure 4: Singular value spectrum of user ratings on sets of movies. Inspecting the singular spectrum shows that the matrix of observed ratings is low-rank.
  • Figure : (a) Observational setting simulations.
  • Figure : (a) Observational setting simulations.
  • ...and 1 more figures

Theorems & Definitions (43)

  • Proposition 4.2
  • Theorem 4.3
  • Theorem 6.6: Finite Sample Consistency of Synthetic Combinations
  • Corollary 6.7
  • Theorem 6.8: Informal
  • Theorem 7.2
  • Corollary 7.3
  • Proposition 8.1
  • Theorem 8.2
  • Theorem B.1: Theorem 2.18 in rigollet2015high
  • ...and 33 more