Interpolated stochastic interventions based on propensity scores, target policies and treatment-specific costs

Johan de Aguas

Interpolated stochastic interventions based on propensity scores, target policies and treatment-specific costs

Johan de Aguas

TL;DR

This work develops cost-aware stochastic interventions for discrete treatments by formulating a cost-penalized information projection that yields Boltzmann-Gibbs couplings and tilted marginals. The two delta-indexed families interpolate between input policies and a product-of-experts limit under positive costs, controlled by a single tilt parameter. Efficient semiparametric estimators based on influence functions and one-step corrections are derived, with cross-fitting and uniform confidence bands for inference. The framework supports graded hypothesis testing and policy design under budgets, enabling prospective policy prototyping from observational data prior to experiments.

Abstract

We introduce two families of stochastic interventions with discrete treatments that connect causal modeling to cost-sensitive decision making. The interventions arise from a cost-penalized information projection of the independent product of the organic propensity scores and a reference policy, yielding closed-form Boltzmann-Gibbs couplings. The induced marginals define modified stochastic policies that interpolate smoothly, via a tilt parameter, from the organic law or from the reference law toward a product-of-experts limit when all destination costs are strictly positive. The first family recovers and extends incremental propensity score interventions, retaining identification without global positivity. For inference on the expected outcomes after these policies, we derive the efficient influence functions under a nonparametric model and construct one-step estimators. In simulations, the proposed estimators improve stability and robustness to nuisance misspecification relative to plug-in baselines. The framework can operationalize graded scientific hypotheses under realistic constraints. Because inputs are modular, analysts can sweep feasible policy spaces, prototype candidates, and align interventions with budgets and logistics before committing experimental resources.

Interpolated stochastic interventions based on propensity scores, target policies and treatment-specific costs

TL;DR

Abstract

Interpolated stochastic interventions based on propensity scores, target policies and treatment-specific costs

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)

Theorems & Definitions (5)