Deep Reinforcement Learning for Online Optimal Execution Strategies

Alessandro Micheli; Mélodie Monod

Deep Reinforcement Learning for Online Optimal Execution Strategies

Alessandro Micheli, Mélodie Monod

TL;DR

A novel actor-critic algorithm based on Deep Deterministic Policy Gradient (DDPG) is introduced to address the challenge of learning non-Markovian optimal execution strategies in dynamic financial markets, with a focus on transient price impact modeled by a general decay kernel.

Abstract

This paper tackles the challenge of learning non-Markovian optimal execution strategies in dynamic financial markets. We introduce a novel actor-critic algorithm based on Deep Deterministic Policy Gradient (DDPG) to address this issue, with a focus on transient price impact modeled by a general decay kernel. Through numerical experiments with various decay kernels, we show that our algorithm successfully approximates the optimal execution strategy. Additionally, the proposed algorithm demonstrates adaptability to evolving market conditions, where parameters fluctuate over time. Our findings also show that modern reinforcement learning algorithms can provide a solution that reduces the need for frequent and inefficient human intervention in optimal execution tasks.

Deep Reinforcement Learning for Online Optimal Execution Strategies

TL;DR

Abstract

Deep Reinforcement Learning for Online Optimal Execution Strategies

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (5)