Hadamard Representations: Augmenting Hyperbolic Tangents in RL
Jacob E. Kooi, Mark Hoogendoorn, Vincent François-Lavet
TL;DR
The paper investigates dying neurons in RL and shows that continuously differentiable activations like tanh suffer from saturation similar to ReLU, limiting performance. It introduces Hadamard representations, where a hidden layer is the Hadamard product of two parallel activations, to preserve gradient flow and reduce dead neurons for tanh. Empirical results across DQN, PPO, and PQN on Atari demonstrate faster learning, higher effective rank, and substantial performance gains with tanh HR, while HR does not help ReLU. The work highlights a path to leverage smooth activations in RL and suggests broader exploration of Hadamard-style architectures and activation combos, albeit with increased parameter counts and activation-dependent effects.
Abstract
Activation functions are one of the key components of a deep neural network. The most commonly used activation functions can be classed into the category of continuously differentiable (e.g. tanh) and piece-wise linear functions (e.g. ReLU), both having their own strengths and drawbacks with respect to downstream performance and representation capacity through learning. In reinforcement learning, the performance of continuously differentiable activations often falls short as compared to piece-wise linear functions. We show that the dying neuron problem in RL is not exclusive to ReLUs and actually leads to additional problems in the case of continuously differentiable activations such as tanh. To alleviate the dying neuron problem with these activations, we propose a Hadamard representation that unlocks the advantages of continuously differentiable activations. Using DQN, PPO and PQN in the Atari domain, we show faster learning, a reduction in dead neurons and increased effective rank.
