Monotone Causality in Opportunistically Stochastic Shortest Path Problems

Mallory E. Gaspard; Alexander Vladimirsky

Monotone Causality in Opportunistically Stochastic Shortest Path Problems

Mallory E. Gaspard, Alexander Vladimirsky

TL;DR

This work defines a class of Opportunistically Stochastic Shortest Path (OSSP) problems and deriving sufficient conditions for applicability of noniterative label-setting methods, and uses OSSPs to derive causality conditions for semi-Lagrangian discretizations of anisotropic Hamilton-Jacobi equations.

Abstract

When traveling through a graph with an accessible deterministic path to a target, is it ever preferable to resort to stochastic node-to-node transitions instead? And if so, what are the conditions guaranteeing that such a stochastic optimal routing policy can be computed efficiently? We aim to answer these questions here by defining a class of Opportunistically Stochastic Shortest Path (OSSP) problems and deriving sufficient conditions for applicability of non-iterative label-setting methods. The usefulness of this framework is demonstrated in two very different contexts: numerical analysis and autonomous vehicle routing. We use OSSPs to derive causality conditions for semi-Lagrangian discretizations of anisotropic Hamilton-Jacobi equations. We also use a Dijkstra-like method to solve OSSPs optimizing the timing and urgency of lane change maneuvers for an autonomous vehicle navigating road networks with a heterogeneous traffic load.

Monotone Causality in Opportunistically Stochastic Shortest Path Problems

TL;DR

Abstract

Paper Structure (17 sections, 11 theorems, 57 equations, 13 figures)

This paper contains 17 sections, 11 theorems, 57 equations, 13 figures.

INTRODUCTION
SSPs and OSSPs
General Stochastic Shortest Path Problems
Randomized Policies, Dominated Actions, and Convexified Cost
Opportunistically Stochastic Shortest Path Problems
Label-Setting and Monotone Causality in OSSPs
Label-Setting Methods for Deterministic SP Problems
Applicability of Noniterative Methods to general SSPs
Guaranteeing Monotone Causality in OSSPs
OSSPs and Continuous Optimal Control Problems
Background, prior work, and notation
Monotone Causality in $\mathbb{R}^2$
Monotone Causality in $\mathbb{R}^{3}$
Monotone $\delta$-Causality in $\mathbb{R}^2$
OSSPs and Autonomous Vehicle Routing
...and 2 more sections

Key Result

Theorem 1

\newlabeltheorem:gen_caus_condition_improved0 Suppose there exists a $\delta \ge 0$ such that, for all $\bm{x}_i \in X \backslash \{\bm{t}\}$ and $\bm{a} \in A_i^u,$ If these conditions are satisfied, this OSSP is monotone causal and Dijkstra's method is applicable. If $\delta >0,$ the OSSP is monotone $\delta$-causal and Dial's method with buckets of width $\delta$ is also applicable.

Figures (13)

Figure 1: Transition digraphs for two simple SSP examples. The target node is marked by $\bm{t}$, and the possibility of a transition from node $\bm{x}_i$ to a successor node $\bm{x}_j$ is indicated by a dashed arrow.
Figure 1: The geometric interpretation of condition \ref{['eq:original_MC_cond']} for $m = 2$ with $\delta = 0$ (panel (a)) and $\delta = 0.3$ (panel (b)) for several examples of transition cost functions. The solid purple and solid gold curves are smooth and convex. The solid green curve is smooth and nonconvex, and the green-dashed curve is its convexified version. In all cases, $K(p)$ is monotone ($\delta$-)causal provided that the curve stays entirely above the two restriction lines on the interval $[0, 1]$. In (a), the purple curve violates the condition \ref{['eq:original_MC_cond']} for $r = 1$ even with $\delta = 0$; so, it cannot be monotone causal. While the other two curves satisfy \ref{['eq:original_MC_cond']} with $\delta = 0$, only the nonconvex curve (along with its convexified version) is monotone $\delta$-causal for $\delta \leq 0.3$. The smooth gold convex curve can only be monotone causal (with $\delta = 0$), as the restriction lines imposed by \ref{['eq:original_MC_cond']} coincide with its tangent lines at $p = 0$ and $p = 1$. \newlabelfig:mc_geometric_illustration0
Figure 1: Two computational stencils based on a uniform Cartesian grid in $\mathbb{R}^2$. The value function is computed at $\bm{x}$, and $\tilde{\bm{x}}_{\bm{\xi}}$ is the new position after traveling in a straight line along the chosen direction of motion $\bm{a}_{\bm{\xi}}$ with speed $f(\bm{x}, \bm{a}_{\bm{\xi}})$. In panel (a), the modes are essentially quadrants: $\mathcal{S}(\bm{x}) = \{ (\bm{x}_1, \bm{x}_3), (\bm{x}_3, \bm{x}_5), (\bm{x}_5, \bm{x}_7), (\bm{x}_7, \bm{x}_1) \}.$ In panel (b), each quadrant is split into two modes: $\mathcal{S}(\bm{x}) = \{ (\bm{x}_1, \bm{x}_2), (\bm{x}_2, \bm{x}_3), (\bm{x}_3, \bm{x}_4), (\bm{x}_4, \bm{x}_5), (\bm{x}_5, \bm{x}_6), (\bm{x}_6, \bm{x}_7), (\bm{x}_7, \bm{x}_8), (\bm{x}_8, \bm{x}_1) \}.$ \newlabelfig:4pt_and_8pt_stencils0
Figure 1: Panel (a): Example lane-level road network representation of a three-lane highway. Lanes are discretized into cells of length $D$ meters, and each node marks the center of a cell. The vehicle travels from the starting point $\bm{s}$ to the destination $\bm{t}$ via a series of planned LSMs. Panel (b): Actions available at node $\bm{x}$ in mode $s_1 = \{\bm{x}_j, \bm{x}_k\} \in \mathcal{S}(\bm{x})$. The vehicle may continue driving in the current lane and directly transition to $\bm{x}_j$ (solid purple arrow), forcefully switch lanes and directly transition to $\bm{x}_k$ (solid red arrow), or attempt a tentative lane change (dashed blue arrow). The other mode available at $\bm{x}$ is $s_2 = \{\bm{x}_j, \bm{x}_i\}$, encoding a possible switch to another lane. \newlabelfig:lane_level_network0
Figure 2: Example of deterministic (panel (a)) and probabilistic (panel (b)) pruning of the action set when $m = 2$. Depicted action choices at the node $\bm{x}_1$ in Figure \ref{['fig:ssp_examples']}(b), with transition probabilities $\mathbb{P}(\bm{x}_1 \rightarrow \bm{x}_5 \mid \bm{a}) = p(\bm{a})$ and $\mathbb{P}(\bm{x}_1 \rightarrow \bm{x}_2 \mid \bm{a}) = 1-p(\bm{a}).$ The specific cost function $K(p(\bm{a}))$ is chosen for the sake of illustration only. Many application-motivated examples of cost functions are considered in §\ref{['section:ossp_and_ahj']} and §\ref{['section:lane_change_formulation']}. Panel (a): The transition probabilities associated with each action in $A_1 = [\underline{\bm{a}}, \bar{\bm{a}}] \cup \{\bm{a}_1, \ldots, \bm{a}_4\}$ are indicated in green on the $p$-axis. During the deterministic pruning process, all actions $\bm{a} \in A_1$ that are transition-equivalent to another action $\tilde{\bm{a}} \in A_1$ with $K(p(\bm{a})) \ge K(p(\tilde{\bm{a}}))$ are removed along with the corresponding portion of the transition cost curve (crossed in red). Panel (b): Following the deterministic pruning, we obtain $\check{K}(p)$ by taking the lower convex envelope of $K$ (solid and dashed purple curve), and the resulting $A^{co}_1$ is indicated in orange above the $p$-axis. Replaceable actions $\bm{a}'$ and $\bm{a}_1$ are also removed at this stage. The transition probabilities associated with the remaining useful pure actions $A_1^u$ are indicated on the $p$-axis in solid purple, and their corresponding transition cost values are also marked in solid purple along $\check{K}$. \newlabelfig:cost_pruning0
...and 8 more figures

Theorems & Definitions (30)

Definition 1: Transition-equivalent Relaxed Actions
Definition 2: Dominated, Replaceable, and Useful Actions
proof 1
Definition 3: OSSP
Definition 4
Theorem 1: Sufficient Monotone $\delta$-Causality Condition in OSSPs
proof 2
Theorem 2: Simplified Monotone $\delta$-Causality Condition in OSSPs
proof 3
Remark 1
...and 20 more

Monotone Causality in Opportunistically Stochastic Shortest Path Problems

TL;DR

Abstract

Monotone Causality in Opportunistically Stochastic Shortest Path Problems

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (30)