Causal Identification in Multi-Task Demand Learning with Confounding

Varun Gupta; Vijay Kamble

Causal Identification in Multi-Task Demand Learning with Confounding

Varun Gupta, Vijay Kamble

TL;DR

This paper tackles causal identification in multi-task demand learning when historical prices are endogenously set and potentially correlated with unobserved demand determinants. It shows that standard pooled or meta-learning approaches can converge to policy-dependent estimands and may fail to identify true causal price effects. The authors propose Decision-Conditioned Masked-Outcome Meta-Learning (DCMOML), an information-design framework that conditions on the price path to absorb latent confounding while masking two candidate query outcomes to prevent identifiability collapse; under a mild exogeneity condition on the final two-price block, DCMOML identifies the conditional mean of task-specific causal parameters and is statistically consistent. Empirically, DCMOML outperforms baselines in synthetic experiments with varying confounding and on real UK online-retail data, showing practical gains in large-scale pricing settings where randomized experiments are infeasible. The work bridges demand estimation under endogeneity with cross-task transfer and provides a scalable, identification-based approach for causal pricing in operational environments.

Abstract

We study a canonical multi-task demand learning problem motivated by retail pricing, in which a firm seeks to estimate heterogeneous linear price-response functions across a large collection of decision contexts. Each context is characterized by rich observable covariates yet typically exhibits only limited historical price variation, motivating the use of multi-task learning to borrow strength across tasks. A central challenge in this setting is endogeneity: historical prices are chosen by managers or algorithms and may be arbitrarily correlated with unobserved, task-level demand determinants. Under such confounding by latent fundamentals, commonly used approaches, such as pooled regression and meta-learning, fail to identify causal price effects. We propose a new estimation framework that achieves causal identification despite arbitrary dependence between prices and latent task structure. Our approach, Decision-Conditioned Masked-Outcome Meta-Learning (DCMOML), involves carefully designing the information set of a meta-learner to leverage cross-task heterogeneity while accounting for endogenous decision histories. Under a mild restriction on price adaptivity in each task, we establish that this method identifies the conditional mean of the task-specific causal parameters given the designed information set. Our results provide guarantees for large-scale demand estimation with endogenous prices and small per-task samples, offering a principled foundation for deploying causal, data-driven pricing models in operational environments.

Causal Identification in Multi-Task Demand Learning with Confounding

TL;DR

Abstract

Paper Structure (47 sections, 2 theorems, 90 equations, 5 figures, 1 table)

This paper contains 47 sections, 2 theorems, 90 equations, 5 figures, 1 table.

Introduction
Literature
Demand estimation and price endogeneity in empirical IO and marketing.
Learning and optimization in data-scarce pricing environments.
Partial pooling, Empirical Bayes, and hierarchical modeling.
Multi-task learning and meta-learning under endogenous decisions.
Causal machine learning and heterogeneous treatment effects.
Model
Tasks, Data, and Notation
Structural Demand Model
Heterogeneity and Shared Structure Across Tasks
Price Assignment and Confounding
Causal Estimand and Objective
Baseline Approaches and Challenges
Learning the Shared Model
...and 32 more sections

Key Result

Theorem 1

Consider the model of Section sec:model and suppose that Assumption ass:eps_cond_mean0 holds. Consider the two-point query design $k_i^1\sim\mathrm{Unif}\{K_i^*,K\}$ and masked-outcome information sets $X_i^{-(K_i^*,K)}$ defined in eq:info_drop2_twopoint. Consider the population risk over measurable where the expectation is taken over the joint law of $(Z_i,p_{i1:K},D_{i1:K},k_i^1)$ induced by the

Figures (5)

Figure 1: Illustration of confounding in the multi-task pricing setting ($N=2000$, $K=2$). Left: True (causal) demand curves all have negative slopes. Right: Pooled data exhibit a positive price--demand relationship due to optimal pricing responding to store-specific intercepts.
Figure 2: Estimation performance of the simple outcome-based meta-estimator $\hat{\theta}_{ij} = a_j p_{i1} + b_j D_{i1} + c_j$ for $j\in\{0,1\}$ with $K=2$. The left panel shows slope estimates while the right panel shows intercept estimates.
Figure 3: Estimation performance of the DCMOML meta-learner $\hat{\theta}_{ij} = a_j (p_{i1} + p_{i2}) + c_j$ for $j\in\{0,1\}$ with $K=2$. The estimator well approximates the demand parameters across all tasks.
Figure 4: Estimation error across confounding levels for $K=2$. Error bars show $\pm 1$ SE.
Figure 5: Held-out RMSE with 95% confidence intervals. DCMOML is highlighted in red.

Theorems & Definitions (11)

Remark 1: Relation to Double Machine Learning and the R-Learner
Example 1: Shared-Model Estimation Can Fail under Near-Optimal Local Pricing behavior
Example 2: Failure of Meta-Learning
Remark 2
Definition 1: Decision-Conditioned Masked-Outcome Meta-Learning (DCMOML)
Remark 3
Theorem 1: Identifiability
Theorem 2: Consistency of DCMOML under realizability
Example 3: Continuation of Example \ref{['ex:sign_flip']}
proof : Proof of Theorem \ref{['thm:identify']}
...and 1 more

Causal Identification in Multi-Task Demand Learning with Confounding

TL;DR

Abstract

Causal Identification in Multi-Task Demand Learning with Confounding

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (11)