Deep multitask neural networks for solving some stochastic optimal control problems

Christian Yeo

Deep multitask neural networks for solving some stochastic optimal control problems

Christian Yeo

TL;DR

Addresses the challenge of solving BDPP-based stochastic optimal control problems when the state distribution is unknown and state simulations are infeasible. Proposes a deep multitask neural network per date with a shared feature extractor and task-specific heads to learn all bang-bang decisions simultaneously, aided by a novel loss weighting scheme called Sigmoid-Moving Average GradNorm (S-MAG) to balance learning across many tasks. The approach is validated on commodity derivative problems (Take-or-Pay and swing contracts) in one- and three-factor models, where it outperforms state-of-the-art BDPP-based methods and the Longstaff-Schwartz approach. The work demonstrates a scalable, data-efficient framework for BDPP-type SOC problems with potential applicability beyond finance to other domains requiring dynamic programming under uncertainty.

Abstract

Most existing neural network-based approaches for solving stochastic optimal control problems using the associated backward dynamic programming principle rely on the ability to simulate the underlying state variables. However, in some problems, this simulation is infeasible, leading to the discretization of state variable space and the need to train one neural network for each data point. This approach becomes computationally inefficient when dealing with large state variable spaces. In this paper, we consider a class of this type of stochastic optimal control problems and introduce an effective solution employing multitask neural networks. To train our multitask neural network, we introduce a novel scheme that dynamically balances the learning across tasks. Through numerical experiments on real-world derivatives pricing problems, we prove that our method outperforms state-of-the-art approaches.

Deep multitask neural networks for solving some stochastic optimal control problems

TL;DR

Abstract

Paper Structure (12 sections, 31 equations, 6 figures, 3 tables)

This paper contains 12 sections, 31 equations, 6 figures, 3 tables.

Introduction
Stochastic optimal control and dynamic programming
Stochastic optimal control problem
Backward dynamic programming principle
Multitask learning
Towards multitask learning
Training multitask neural network
Experiments
Implementation details
Take-or-Pay contract
Swing contract with penalty
Conclusion

Figures (6)

Figure 1: Illustration of the space of cumulative controls. We used $\underline{q} = 0$. Blue and green points will be called trivial tasks in the subsequent analysis.
Figure 2: Multitask network architecture. The shared module is made of the input layer and 2 hidden layers. There are $I_k=4$ task specific layers for each task. Each output $\chi_k^{(i)}(\cdot;\theta)$ ($i=1,\ldots,I_k$) of task specific layers is a $\mathbb{R}^2$-valued vector used to define our approximation of the decision function as described in \ref{['def_decision_function_f']}.
Figure 3: Illustration of the deep backward multitask network. One $\mathbb{R}$-valued feedforward neural network for date $t_0$ and the remaining are multitask neural networks.
Figure 4: Take-or-Pay price estimation in the one-factor model and for the first 100 iterations. The shaded areas represent confidence interval.
Figure 5: Take-or-Pay price estimation in the three-factor model and for the first 100 iterations. The shaded areas represent confidence interval.
...and 1 more figures

Deep multitask neural networks for solving some stochastic optimal control problems

TL;DR

Abstract

Deep multitask neural networks for solving some stochastic optimal control problems

Authors

TL;DR

Abstract

Table of Contents

Figures (6)