Utilizing Priors in Sampling-based Cost Minimization
Yuan-Yao Lou, Jonathan Spencer, Kwang Taik Kim, Mung Chiang
TL;DR
This work considers an autonomous vehicle agent performing a long-term cost-minimization problem in the elapsed time T over sequences of states and actions for some fixed, known cost function, approximate system dynamics, and distribution over initial states.
Abstract
We consider an autonomous vehicle (AV) agent performing a long-term cost-minimization problem in the elapsed time $T$ over sequences of states $s_{1:T}$ and actions $a_{1:T}$ for some fixed, known (though potentially learned) cost function $C(s_t,a_t)$, approximate system dynamics $P$, and distribution over initial states $d_0$. The goal is to minimize the expected cost-to-go of the driving trajectory $τ= s_1, a_1, ..., s_T, a_T$ from the initial state.
