Hadamax Encoding: Elevating Performance in Model-Free Atari
Jacob E. Kooi, Zhao Yang, Vincent François-Lavet
TL;DR
This work tackles limitations in pixel-based model-free reinforcement learning by introducing the Hadamax encoder, which combines max-pooling down-sampling, Hadamard representations across parallel hidden layers, and GELU activations within the PQN framework. The approach yields state-of-the-art Atari-57 results without any hyperparameter changes and speeds training relative to baselines, while also transferring improvements to other algorithms like C51 and DQN/Rainbow. The authors demonstrate that Hadamax increases deeper-layer effective rank and maintains a low proportion of dead neurons, supporting stable, high-capacity representations. Overall, Hadamax provides a strong architectural default for model-free Atari agents and motivates future exploration of scalable, Hadamard-based encoders and MoE extensions.
Abstract
Neural network architectures have a large impact in machine learning. In reinforcement learning, network architectures have remained notably simple, as changes often lead to small gains in performance. This work introduces a novel encoder architecture for pixel-based model-free reinforcement learning. The Hadamax (\textbf{Hada}mard \textbf{max}-pooling) encoder achieves state-of-the-art performance by max-pooling Hadamard products between GELU-activated parallel hidden layers. Based on the recent PQN algorithm, the Hadamax encoder achieves state-of-the-art model-free performance in the Atari-57 benchmark. Specifically, without applying any algorithmic hyperparameter modifications, Hadamax-PQN achieves an 80\% performance gain over vanilla PQN and significantly surpasses Rainbow-DQN. For reproducibility, the full code is available on \href{https://github.com/Jacobkooi/Hadamax}{GitHub}.
