Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network

Bingdong Li; Mei Jiang; Hong Qian; Ke Tang; Aimin Zhou; Peng Yang

Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network

Bingdong Li, Mei Jiang, Hong Qian, Ke Tang, Aimin Zhou, Peng Yang

TL;DR

This paper tackles the high computational cost of evolutionary reinforcement learning in high-dimensional policy spaces by introducing a learnable surrogate-assisted framework (AE-HNN-NCS). It combines an Autoencoder for adaptive policy embedding with a Hyperbolic Neural Network surrogate to perform ranking-based pre-selection, reducing real environment evaluations while preserving search efficacy. Empirical results on 10 Atari games and 4 MuJoCo tasks show that AE-HNN-NCS outperforms baselines and state-of-the-art ERL methods, with faster wall-clock training due to reduced evaluations and more structured exploration trajectories. The approach offers a scalable, end-to-end learnable solution to the curse of dimensionality in ERL and points to future enhancements in autoencoder variants, regression surrogates, and diversification strategies.

Abstract

Evolutionary Reinforcement Learning (ERL), training the Reinforcement Learning (RL) policies with Evolutionary Algorithms (EAs), have demonstrated enhanced exploration capabilities and greater robustness than using traditional policy gradient. However, ERL suffers from the high computational costs and low search efficiency, as EAs require evaluating numerous candidate policies with expensive simulations, many of which are ineffective and do not contribute meaningfully to the training. One intuitive way to reduce the ineffective evaluations is to adopt the surrogates. Unfortunately, existing ERL policies are often modeled as deep neural networks (DNNs) and thus naturally represented as high-dimensional vectors containing millions of weights, which makes the building of effective surrogates for ERL policies extremely challenging. This paper proposes a novel surrogate-assisted ERL that integrates Autoencoders (AE) and Hyperbolic Neural Networks (HNN). Specifically, AE compresses high-dimensional policies into low-dimensional representations while extracting key features as the inputs for the surrogate. HNN, functioning as a classification-based surrogate model, can learn complex nonlinear relationships from sampled data and enable more accurate pre-selection of the sampled policies without real evaluations. The experiments on 10 Atari and 4 Mujoco games have verified that the proposed method outperforms previous approaches significantly. The search trajectories guided by AE and HNN are also visually demonstrated to be more effective, in terms of both exploration and convergence. This paper not only presents the first learnable policy embedding and surrogate-modeling modules for high-dimensional ERL policies, but also empirically reveals when and why they can be successful.

Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network

TL;DR

Abstract

Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)