Table of Contents
Fetching ...

Causal Identification in Multi-Task Demand Learning with Confounding

Varun Gupta, Vijay Kamble

TL;DR

This paper tackles causal identification in multi-task demand learning when historical prices are endogenously set and potentially correlated with unobserved demand determinants. It shows that standard pooled or meta-learning approaches can converge to policy-dependent estimands and may fail to identify true causal price effects. The authors propose Decision-Conditioned Masked-Outcome Meta-Learning (DCMOML), an information-design framework that conditions on the price path to absorb latent confounding while masking two candidate query outcomes to prevent identifiability collapse; under a mild exogeneity condition on the final two-price block, DCMOML identifies the conditional mean of task-specific causal parameters and is statistically consistent. Empirically, DCMOML outperforms baselines in synthetic experiments with varying confounding and on real UK online-retail data, showing practical gains in large-scale pricing settings where randomized experiments are infeasible. The work bridges demand estimation under endogeneity with cross-task transfer and provides a scalable, identification-based approach for causal pricing in operational environments.

Abstract

We study a canonical multi-task demand learning problem motivated by retail pricing, in which a firm seeks to estimate heterogeneous linear price-response functions across a large collection of decision contexts. Each context is characterized by rich observable covariates yet typically exhibits only limited historical price variation, motivating the use of multi-task learning to borrow strength across tasks. A central challenge in this setting is endogeneity: historical prices are chosen by managers or algorithms and may be arbitrarily correlated with unobserved, task-level demand determinants. Under such confounding by latent fundamentals, commonly used approaches, such as pooled regression and meta-learning, fail to identify causal price effects. We propose a new estimation framework that achieves causal identification despite arbitrary dependence between prices and latent task structure. Our approach, Decision-Conditioned Masked-Outcome Meta-Learning (DCMOML), involves carefully designing the information set of a meta-learner to leverage cross-task heterogeneity while accounting for endogenous decision histories. Under a mild restriction on price adaptivity in each task, we establish that this method identifies the conditional mean of the task-specific causal parameters given the designed information set. Our results provide guarantees for large-scale demand estimation with endogenous prices and small per-task samples, offering a principled foundation for deploying causal, data-driven pricing models in operational environments.

Causal Identification in Multi-Task Demand Learning with Confounding

TL;DR

This paper tackles causal identification in multi-task demand learning when historical prices are endogenously set and potentially correlated with unobserved demand determinants. It shows that standard pooled or meta-learning approaches can converge to policy-dependent estimands and may fail to identify true causal price effects. The authors propose Decision-Conditioned Masked-Outcome Meta-Learning (DCMOML), an information-design framework that conditions on the price path to absorb latent confounding while masking two candidate query outcomes to prevent identifiability collapse; under a mild exogeneity condition on the final two-price block, DCMOML identifies the conditional mean of task-specific causal parameters and is statistically consistent. Empirically, DCMOML outperforms baselines in synthetic experiments with varying confounding and on real UK online-retail data, showing practical gains in large-scale pricing settings where randomized experiments are infeasible. The work bridges demand estimation under endogeneity with cross-task transfer and provides a scalable, identification-based approach for causal pricing in operational environments.

Abstract

We study a canonical multi-task demand learning problem motivated by retail pricing, in which a firm seeks to estimate heterogeneous linear price-response functions across a large collection of decision contexts. Each context is characterized by rich observable covariates yet typically exhibits only limited historical price variation, motivating the use of multi-task learning to borrow strength across tasks. A central challenge in this setting is endogeneity: historical prices are chosen by managers or algorithms and may be arbitrarily correlated with unobserved, task-level demand determinants. Under such confounding by latent fundamentals, commonly used approaches, such as pooled regression and meta-learning, fail to identify causal price effects. We propose a new estimation framework that achieves causal identification despite arbitrary dependence between prices and latent task structure. Our approach, Decision-Conditioned Masked-Outcome Meta-Learning (DCMOML), involves carefully designing the information set of a meta-learner to leverage cross-task heterogeneity while accounting for endogenous decision histories. Under a mild restriction on price adaptivity in each task, we establish that this method identifies the conditional mean of the task-specific causal parameters given the designed information set. Our results provide guarantees for large-scale demand estimation with endogenous prices and small per-task samples, offering a principled foundation for deploying causal, data-driven pricing models in operational environments.
Paper Structure (47 sections, 2 theorems, 90 equations, 5 figures, 1 table)

This paper contains 47 sections, 2 theorems, 90 equations, 5 figures, 1 table.

Key Result

Theorem 1

Consider the model of Section sec:model and suppose that Assumption ass:eps_cond_mean0 holds. Consider the two-point query design $k_i^1\sim\mathrm{Unif}\{K_i^*,K\}$ and masked-outcome information sets $X_i^{-(K_i^*,K)}$ defined in eq:info_drop2_twopoint. Consider the population risk over measurable where the expectation is taken over the joint law of $(Z_i,p_{i1:K},D_{i1:K},k_i^1)$ induced by the

Figures (5)

  • Figure 1: Illustration of confounding in the multi-task pricing setting ($N=2000$, $K=2$). Left: True (causal) demand curves all have negative slopes. Right: Pooled data exhibit a positive price--demand relationship due to optimal pricing responding to store-specific intercepts.
  • Figure 2: Estimation performance of the simple outcome-based meta-estimator $\hat{\theta}_{ij} = a_j p_{i1} + b_j D_{i1} + c_j$ for $j\in\{0,1\}$ with $K=2$. The left panel shows slope estimates while the right panel shows intercept estimates.
  • Figure 3: Estimation performance of the DCMOML meta-learner $\hat{\theta}_{ij} = a_j (p_{i1} + p_{i2}) + c_j$ for $j\in\{0,1\}$ with $K=2$. The estimator well approximates the demand parameters across all tasks.
  • Figure 4: Estimation error across confounding levels for $K=2$. Error bars show $\pm 1$ SE.
  • Figure 5: Held-out RMSE with 95% confidence intervals. DCMOML is highlighted in red.

Theorems & Definitions (11)

  • Remark 1: Relation to Double Machine Learning and the R-Learner
  • Example 1: Shared-Model Estimation Can Fail under Near-Optimal Local Pricing behavior
  • Example 2: Failure of Meta-Learning
  • Remark 2
  • Definition 1: Decision-Conditioned Masked-Outcome Meta-Learning (DCMOML)
  • Remark 3
  • Theorem 1: Identifiability
  • Theorem 2: Consistency of DCMOML under realizability
  • Example 3: Continuation of Example \ref{['ex:sign_flip']}
  • proof : Proof of Theorem \ref{['thm:identify']}
  • ...and 1 more