Table of Contents
Fetching ...

Causal Strategic Learning with Competitive Selection

Kiet Q. H. Vo, Muneeb Aadil, Siu Lun Chau, Krikamol Muandet

TL;DR

This work studies causal strategic learning with selection in a sequential, multi-DM setting, showing that optimal selection rules must trade off between selecting the best agents and incentivising their improvement, while often relying on noncausal outcome predictions. It derives an explicit AO selection rule $\bm{\theta}^{AO}=\frac{\bm{\alpha}+\gamma\bm{\mathcal{E}}\bm{\mathcal{E}}^{\top}\bm{\theta}^{*}}{\|\bm{\alpha}+\gamma\bm{\mathcal{E}}\bm{\mathcal{E}}^{\top}\bm{\theta}^{*}\|_2}$ under linearity and boundedness assumptions, and identifies welfare safeguards via a positive cosine between the AO direction and the causal-improvement direction. To recover true causal parameters from observational data under selection, the paper introduces ranking-based strategies and Mean-shift Linear Regression (MSLR), along with a cooperative protocol that enables unbiased estimation when multiple DMs are involved. The experimental results on synthetic admissions data illustrate that AO decisions improve utility and that MSLR yields consistent estimates of $\bm{\theta}^{*}$, even under competitive selection, highlighting the importance of causal modeling and regulatory coordination to mitigate gaming and preserve agent welfare.

Abstract

We study the problem of agent selection in causal strategic learning under multiple decision makers and address two key challenges that come with it. Firstly, while much of prior work focuses on studying a fixed pool of agents that remains static regardless of their evaluations, we consider the impact of selection procedure by which agents are not only evaluated, but also selected. When each decision maker unilaterally selects agents by maximising their own utility, we show that the optimal selection rule is a trade-off between selecting the best agents and providing incentives to maximise the agents' improvement. Furthermore, this optimal selection rule relies on incorrect predictions of agents' outcomes. Hence, we study the conditions under which a decision maker's optimal selection rule will not lead to deterioration of agents' outcome nor cause unjust reduction in agents' selection chance. To that end, we provide an analytical form of the optimal selection rule and a mechanism to retrieve the causal parameters from observational data, under certain assumptions on agents' behaviour. Secondly, when there are multiple decision makers, the interference between selection rules introduces another source of biases in estimating the underlying causal parameters. To address this problem, we provide a cooperative protocol which all decision makers must collectively adopt to recover the true causal parameters. Lastly, we complement our theoretical results with simulation studies. Our results highlight not only the importance of causal modeling as a strategy to mitigate the effect of gaming, as suggested by previous work, but also the need of a benevolent regulator to enable it.

Causal Strategic Learning with Competitive Selection

TL;DR

This work studies causal strategic learning with selection in a sequential, multi-DM setting, showing that optimal selection rules must trade off between selecting the best agents and incentivising their improvement, while often relying on noncausal outcome predictions. It derives an explicit AO selection rule under linearity and boundedness assumptions, and identifies welfare safeguards via a positive cosine between the AO direction and the causal-improvement direction. To recover true causal parameters from observational data under selection, the paper introduces ranking-based strategies and Mean-shift Linear Regression (MSLR), along with a cooperative protocol that enables unbiased estimation when multiple DMs are involved. The experimental results on synthetic admissions data illustrate that AO decisions improve utility and that MSLR yields consistent estimates of , even under competitive selection, highlighting the importance of causal modeling and regulatory coordination to mitigate gaming and preserve agent welfare.

Abstract

We study the problem of agent selection in causal strategic learning under multiple decision makers and address two key challenges that come with it. Firstly, while much of prior work focuses on studying a fixed pool of agents that remains static regardless of their evaluations, we consider the impact of selection procedure by which agents are not only evaluated, but also selected. When each decision maker unilaterally selects agents by maximising their own utility, we show that the optimal selection rule is a trade-off between selecting the best agents and providing incentives to maximise the agents' improvement. Furthermore, this optimal selection rule relies on incorrect predictions of agents' outcomes. Hence, we study the conditions under which a decision maker's optimal selection rule will not lead to deterioration of agents' outcome nor cause unjust reduction in agents' selection chance. To that end, we provide an analytical form of the optimal selection rule and a mechanism to retrieve the causal parameters from observational data, under certain assumptions on agents' behaviour. Secondly, when there are multiple decision makers, the interference between selection rules introduces another source of biases in estimating the underlying causal parameters. To address this problem, we provide a cooperative protocol which all decision makers must collectively adopt to recover the true causal parameters. Lastly, we complement our theoretical results with simulation studies. Our results highlight not only the importance of causal modeling as a strategy to mitigate the effect of gaming, as suggested by previous work, but also the need of a benevolent regulator to enable it.
Paper Structure (27 sections, 20 theorems, 92 equations, 12 figures, 2 tables, 1 algorithm)

This paper contains 27 sections, 20 theorems, 92 equations, 12 figures, 2 tables, 1 algorithm.

Key Result

Theorem 3.1

Suppose Assumptions (H1), (H2), (S1), and (S2) hold. Then, the optimal parameter vector for the DM can be expressed as

Figures (12)

  • Figure 1: Causal strategic learning with two decision makers. The statistics (STAT) and cyber-security (CySec) departments, each applies selection rules $\bm{\delta}_{\theta_S}$ and $\bm{\delta}_{\theta_C}$ to maximise their enrolled students' expected outcomes (future GPA). The parameters $\theta_S$ and $\theta_C$ are made public, prompting future students to strategically alter their attributes ($\mathbf{X}$ to $\tilde{\mathbf{X}}$) to boost admission chances.
  • Figure 2: Causal graphs for our settings with selection variable $W_t$ (left) and multiple decision makers (right). The patterned nodes $Y_t$ and $Y_{it}$ represent the partial observability nature of these variables.
  • Figure 3: Estimation of the causal parameters $\bm{\theta}^*$ under various scenarios. Error bars show 95% confidence interval.
  • Figure 4: A scatter plot showing different pairs of values for cPI$(\bm{\theta}_t)$ and cBP$(\bm{\theta}_t)$ by varying $\bm{\theta}_{t}$. The pair that maximises $\mathcal{Q}(\bm{\theta}_{t})$ is shown in orange. Dashed and dotted lines show the respective maximum values of cPI and cBP. In this case, cPI$(\bm{\theta}_t)$ and cBP$(\bm{\theta}_t)$ cannot be maximised simultaneously.
  • Figure 5: Distribution of $X_t$, $B_t$ and $Y_{it}$ in our synthetic dataset, for $n=1$. We refer the readers to the text for details.
  • ...and 7 more figures

Theorems & Definitions (43)

  • Theorem 3.1: Bounded optimum
  • Corollary 3.2: Maximum improvement
  • Corollary 3.3: Bounded reduction
  • Definition 1: Ranking selection
  • Theorem 3.4: Local exogeneity
  • Proposition 1: Dominant strategy
  • Theorem 3.5: Bounded optimum, extended
  • Corollary 3.6: Maximum improvement, extended
  • Corollary 3.7: Bounded reduction, extended
  • Definition 2: Cooperative protocol
  • ...and 33 more