On the grid-sampling limit SDE
Christian Bender, Nguyen Tran Thuan
TL;DR
The grid-sampling SDE is introduced as a proxy for modeling exploration in continuous-time reinforcement learning and its wellposedness in the presence of jumps is discussed.
Abstract
In our recent work [3] we introduced the grid-sampling SDE as a proxy for modeling exploration in continuous-time reinforcement learning. In this note, we provide further motivation for the use of this SDE and discuss its wellposedness in the presence of jumps.
