Targeted Sequential Indirect Experiment Design

Elisabeth Ailer; Niclas Dern; Jason Hartford; Niki Kilbertus

Targeted Sequential Indirect Experiment Design

Elisabeth Ailer, Niclas Dern, Jason Hartford, Niki Kilbertus

TL;DR

This paper tackles the challenge of answering targeted causal questions when direct interventions are impractical and confounding obscures identification. It casts indirect experimentation as a sequential minimax instrumental-variable problem, aiming to bound a query $Q[f_0]$ of a nonlinear, multivariate mechanism by maintaining $Q^+(\pi)$ and $Q^-(\u0003\pi)$ and iteratively reducing the gap $\Delta(\pi)$. The authors derive closed-form estimators for these bounds under RKHS assumptions, and propose adaptive strategies to select experiments that best shrink the bound interval, including a gradient-based adaptive policy with a Gaussian mixture over instrument settings. The approach enables robust, data-efficient narrowing of uncertainty about targeted causal effects in settings with unobserved confounding and complex dependencies, with demonstrated success in synthetic experiments and clear pathways to real-world deployment with further theoretical and practical development. Overall, the work advances adaptive, indirect experimentation by combining minimax IV estimation, RKHS-based closed-form solutions, and sequential policy optimization to identify or tightly bound specific aspects of a system’s mechanism.

Abstract

Scientific hypotheses typically concern specific aspects of complex, imperfectly understood or entirely unknown mechanisms, such as the effect of gene expression levels on phenotypes or how microbial communities influence environmental health. Such queries are inherently causal (rather than purely associational), but in many settings, experiments can not be conducted directly on the target variables of interest, but are indirect. Therefore, they perturb the target variable, but do not remove potential confounding factors. If, additionally, the resulting experimental measurements are multi-dimensional and the studied mechanisms nonlinear, the query of interest is generally not identified. We develop an adaptive strategy to design indirect experiments that optimally inform a targeted query about the ground truth mechanism in terms of sequentially narrowing the gap between an upper and lower bound on the query. While the general formulation consists of a bi-level optimization procedure, we derive an efficiently estimable analytical kernel-based estimator of the bounds for the causal effect, a query of key interest, and demonstrate the efficacy of our approach in confounded, multivariate, nonlinear synthetic settings.

Targeted Sequential Indirect Experiment Design

TL;DR

of a nonlinear, multivariate mechanism by maintaining

and

and iteratively reducing the gap

. The authors derive closed-form estimators for these bounds under RKHS assumptions, and propose adaptive strategies to select experiments that best shrink the bound interval, including a gradient-based adaptive policy with a Gaussian mixture over instrument settings. The approach enables robust, data-efficient narrowing of uncertainty about targeted causal effects in settings with unobserved confounding and complex dependencies, with demonstrated success in synthetic experiments and clear pathways to real-world deployment with further theoretical and practical development. Overall, the work advances adaptive, indirect experimentation by combining minimax IV estimation, RKHS-based closed-form solutions, and sequential policy optimization to identify or tightly bound specific aspects of a system’s mechanism.

Abstract

Paper Structure (18 sections, 7 theorems, 32 equations, 5 figures, 3 algorithms)

This paper contains 18 sections, 7 theorems, 32 equations, 5 figures, 3 algorithms.

Introduction
Problem Setting and Related Work
Problem Setting
Related Work
Methodology
Minimax Instrumental Variable Estimation
Sequential Policy Learning for Bounding Functionals
Solving the Minimax Optimization Problem
Sequential Experiment Selection
Experiments
Discussion and Conclusion
Proofs
Details on Adaptive Policies
Hyperparameter Tuning
Runtimes
...and 3 more sections

Key Result

Theorem 1

If $\mathop{\mathrm{\mathbb{E}}}\nolimits[(Y - f(X))^2 k_Z(Z,Z)] < \infty$, consistently estimates $\sup_{g\in \mathcal{H}_Z, \|g\|\le 1} \langle r_0- Tf, g\rangle_{ \mathcal{H}_Z}$ from data $\mathcal{D}$.

Figures (5)

Figure 1: We compare the different strategies in our synthetic setting. Left and right only differ in the range of the y-axis. The black constant line represents the true value of $Q[f_0]$. Top: Estimated upper and lower bounds $Q^{\pm}_t$ over $t \in [T]$ for $n_{\mathrm{seeds}} = 50$ and two different 'zoom levels' on the y-axis. Lines are means and shaded regions are $(10,90)$-percentiles. Bottom: The final estimated bounds $Q^{\pm}_T$ at $T=16$. The dotted line is $y=0$. Both locally guided heuristic (explore-then-exploit, alternating explore exploit) confidently bound the target query away from zero with a relatively narrow gap between them. Our targeted adaptive strategy is even better and essentially identifies the target query $Q[f_0]=20$ after $T=16$ rounds.
Figure 2: Adaptive Method for different hyperparameter settings. The x-axis shows them in the following order: $(\lambda_c, \lambda_s, \alpha_t)$
Figure 3: Wallclock runtimes of the different methods.
Figure 4: We compare the different strategies in our synthetic setting. The black constant line represents the true value of $Q[f_0]$. Both plots show the estimated upper and lower bounds $Q^{\pm}_t$ at $T = 16$ for $n_{\mathrm{seeds}} = 50$. Lines are means and shaded regions are $(10,90)$-percentiles. Left:${d_z} = 5, {d_x} = 20$, Right:${d_z} = {d_x} = 20$.
Figure 5: We compare the wallclock time in a higher dimensional setting. The time increase is still mild. The main driver is the number of samples in each experiment, instead of the dimensionality itself. Left:${d_z} = 5, {d_x} = 20$, Right:${d_z} = {d_x} = 20$.

Theorems & Definitions (10)

Theorem 1: zhang2020maximum
Theorem 2
Theorem 3: Proposition 10 of dikkala2020minimax
proof
Theorem 3
proof
proof
Theorem 4
Lemma 5: severini2006identification
Lemma 6: severini2012efficiencybounds

Targeted Sequential Indirect Experiment Design

TL;DR

Abstract

Targeted Sequential Indirect Experiment Design

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (10)