Orthogonal Uplift Learning with Permutation-Invariant Representations for Combinatorial Treatments

Xinyan Su; Jiacan Gao; Mingyuan Ma; Xiao Xu; Xinrui Wan; Tianqi Gu; Enyun Yu; Jiecheng Guo; Zhiheng Zhang

Orthogonal Uplift Learning with Permutation-Invariant Representations for Combinatorial Treatments

Xinyan Su, Jiacan Gao, Mingyuan Ma, Xiao Xu, Xinrui Wan, Tianqi Gu, Enyun Yu, Jiecheng Guo, Zhiheng Zhang

TL;DR

An uplift estimation framework that aligns treatment representation with causal semantics is proposed, extending Robinson-style decompositions to learned, vector-valued treatments and is shown to be expressive for policy-induced causal effects, orthogonally robust to nuisance estimation errors, and stable under small policy perturbations.

Abstract

We study uplift estimation for combinatorial treatments. Uplift measures the pure incremental causal effect of an intervention (e.g., sending a coupon or a marketing message) on user behavior, modeled as a conditional individual treatment effect. Many real-world interventions are combinatorial: a treatment is a policy that specifies context-dependent action distributions rather than a single atomic label. Although recent work considers structured treatments, most methods rely on categorical or opaque encodings, limiting robustness and generalization to rare or newly deployed policies. We propose an uplift estimation framework that aligns treatment representation with causal semantics. Each policy is represented by the mixture it induces over contextaction components and embedded via a permutation-invariant aggregation. This representation is integrated into an orthogonalized low-rank uplift model, extending Robinson-style decompositions to learned, vector-valued treatments. We show that the resulting estimator is expressive for policy-induced causal effects, orthogonally robust to nuisance estimation errors, and stable under small policy perturbations. Experiments on large-scale randomized platform data demonstrate improved uplift accuracy and stability in long-tailed policy regimes

Orthogonal Uplift Learning with Permutation-Invariant Representations for Combinatorial Treatments

TL;DR

Abstract

Paper Structure (46 sections, 6 theorems, 69 equations, 1 figure, 3 tables, 2 algorithms)

This paper contains 46 sections, 6 theorems, 69 equations, 1 figure, 3 tables, 2 algorithms.

Introduction
Problem Setup and Method
Structured Treatment Representation
Treatment-side contexts and atomic actions.
Permutation-invariant treatment embedding.
Implementation of the treatment network.
Orthogonalized Factorized Uplift Model
Model class.
Learning objective.
Computing $\hat{e}_h(x)$.
Training Procedure
Technical Challenges
Theoretical Analysis
Expressiveness of the Permutation-Invariant Policy Embedding
Orthogonal Robustness
...and 31 more sections

Key Result

Proposition 3.2

Under Assumption ass:mixture_suff and consistency $Y=Y(T)$, define the oracle treatment representation $h_0(t):=F(\mu_t)$ and $e_0(x):=\mathbb{E}[h_0(T)\mid X=x]$. Then there exists a function $m(\cdot)$ such that the conditional mean satisfies and the CATE defined in eq:cate_def obeys

Figures (1)

Figure 1: Treatment Net: permutation-invariant embedding of combinatorial treatments. A treatment $T$ is observed through its policy specification: for each context $s\in S$ (instantiated here as $\{s_1, s_2\}$), the treatment induces a distribution over atomic actions $a\in \mathcal{A}$ (instantiated as $\{a_0, a_1, a_2\}$). The network first learns embeddings for contexts and actions to form atom representations $\phi(s,a)$. These atoms are then reweighted by policy probabilities and aggregated via a permutation-invariant sum $z(T)$, ensuring the representation depends only on the induced mixture. Finally, a learnable map produces the treatment embedding $h(T)$, which is used by the orthogonalized uplift model (left panel) to estimate the causal effect $\tau(x)$.

Theorems & Definitions (12)

Proposition 3.2: Well-specified orthogonalized low-rank representation
Lemma 3.3: Expressiveness of permutation-invariant embeddings
Lemma 3.4: Neyman orthogonality w.r.t. nuisance $(m,e)$
Theorem 3.6: Orthogonal robustness: nuisance errors enter at second order
Proposition 3.7: Stability of the embedding and uplift
proof : Proof of Proposition \ref{['prop:olr_wellspec']}
proof : Proof of Lemma \ref{['thm:expressiveness']}
proof : Proof of Lemma \ref{['lem:neyman_orth']}
Lemma 2.1: Second-order control of score perturbation
proof : Proof of Lemma \ref{['lem:score_perturb']}
...and 2 more

Orthogonal Uplift Learning with Permutation-Invariant Representations for Combinatorial Treatments

TL;DR

Abstract

Orthogonal Uplift Learning with Permutation-Invariant Representations for Combinatorial Treatments

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (12)