Controlled rough SDEs, pathwise stochastic control and dynamic programming principles
Peter K. Friz, Khoa Lê, Huilin Zhang
TL;DR
This work develops a unifying rough-path framework for stochastic optimal control of RSDEs by replacing conditioning noise with a deterministic rough input $\mathbf{X}$ and studying the resulting rough value function $\mathcal{V}(s,y;\mathbf{X})$. It proves regularity and a rough dynamic programming principle, and provides a measurable-selection approach to connect RSDEs with doubly stochastic SDEs under conditioning. Through two-stage randomization, it shows that different admissible control classes yield the same stochastic value function in a precise sense and derives a stochastic DPP for the randomized problems. The results extend to Brownian noise, where causal controls recover classical Itô SDEs and reveal a clean connection to doubly stochastic SDEs, thereby unifying pathwise and probabilistic control perspectives without requiring conditioning noise statistics. Overall, the paper offers a coherent, robust treatment of pathwise stochastic control under rough inputs with implications for filtering, SPDEs, and reinforcement learning.
Abstract
We study stochastic optimal control of rough stochastic differential equations (RSDEs). This is in the spirit of the pathwise control problem (Lions--Souganidis 1998, Buckdahn--Ma 2007; also Davis--Burstein 1992), with renewed interest and recent works drawing motivation from filtering, SPDEs, and reinforcement learning. Results include regularity of rough value functions, validity of a rough dynamic programming principles and new rough stability results for HJB equations, removing excessive regularity demands previously imposed by flow transformation methods. Measurable selection is used to relate RSDEs to "doubly stochastic" SDEs under conditioning. In contrast to previous works, Brownian statistics for the to-be-conditioned-on noise are not required, aligned with the "pathwise" intuition that these should not matter upon conditioning. Depending on the chosen class of admissible controls, the involved processes may also be anticipating. The resulting stochastic value functions coincide in great generality for different classes of controls. RSDE theory offers a powerful and unified perspective on this problem class.
