Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning
Alihan Hüyük, Finale Doshi-Velez
TL;DR
This paper introduces strategic link scores, defined as the drop in the likelihood of a setup decision when a necessary pay-off decision is blocked, formalized as $\mathfrak{S}^{\dagger}_{(s,a)\to(\tilde{s},\tilde{a})} = \pi^{\dagger}(a|s) - \pi^{\dagger:\{\pi(\tilde{a}|\tilde{s})=0\}}(a|s)$, to capture interdependencies in long-horizon planning. It demonstrates three main applications: planning-level explanations for RL by identifying strategically linked decisions, safe policy improvement through strategy-aware recommendations, and interventions-based characterization of planning behavior in both RL and non-RL agents (including a traffic routing scenario). The approach is illustrated with GridWorld experiments, a Shortcuts environment, and a realistic traffic simulator, showing that strategic links can be inferred from demonstrations via inverse reinforcement learning or from known planners. The results highlight improved explainability, safer incremental improvements, and the ability to quantify planning horizons through interventions, offering a broadly applicable tool for understanding and guiding strategic decision-making under constraints.
Abstract
Long-term planning, as in reinforcement learning (RL), involves finding strategies: actions that collectively work toward a goal rather than individually optimizing their immediate outcomes. As part of a strategy, some actions are taken at the expense of short-term benefit to enable future actions with even greater returns. These actions are only advantageous if followed up by the actions they facilitate, consequently, they would not have been taken if those follow-ups were not available. In this paper, we quantify such dependencies between planned actions with strategic link scores: the drop in the likelihood of one decision under the constraint that a follow-up decision is no longer available. We demonstrate the utility of strategic link scores through three practical applications: (i) explaining black-box RL agents by identifying strategically linked pairs among decisions they make, (ii) improving the worst-case performance of decision support systems by distinguishing whether recommended actions can be adopted as standalone improvements or whether they are strategically linked hence requiring a commitment to a broader strategy to be effective, and (iii) characterizing the planning processes of non-RL agents purely through interventions aimed at measuring strategic link scores - as an example, we consider a realistic traffic simulator and analyze through road closures the effective planning horizon of the emergent routing behavior of many drivers.
