Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space

Rajesh Mangannavar; Gopalakrishnan Srinivasaraghavan

Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space

Rajesh Mangannavar, Gopalakrishnan Srinivasaraghavan

TL;DR

This paper introduces a prioritized form of a combination of state-of-the-art approaches such as Deep Q-learning (DQN) and Deep Deterministic Policy Gradient (DDPG) to outperform the earlier results for continuous state and action space problems.

Abstract

Among the many variants of RL, an important class of problems is where the state and action spaces are continuous -- autonomous robots, autonomous vehicles, optimal control are all examples of such problems that can lend themselves naturally to reinforcement based algorithms, and have continuous state and action spaces. In this paper, we introduce a prioritized form of a combination of state-of-the-art approaches such as Deep Q-learning (DQN) and Deep Deterministic Policy Gradient (DDPG) to outperform the earlier results for continuous state and action space problems. Our experiments also involve the use of parameter noise during training resulting in more robust deep RL models outperforming the earlier results significantly. We believe these results are a valuable addition for continuous state and action space problems.

Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space

TL;DR

Abstract

Learning Agents With Prioritization and Parameter Noise in Continuous State and Action Space

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)