Scalable Reinforcement Learning-based Neural Architecture Search
Amber Cassimon, Siegfried Mercelis, Kevin Mets
TL;DR
This paper tackles the scalability bottleneck in Neural Architecture Search (NAS) by learning to search architectures through reinforcement learning, rather than producing a single optimal model. It casts NAS as an incremental decision process over graphs of architectures with an explicit MDP formulation, neighbor-generation rules, reward shaping, and a transformer-based Ape-X Q-learning agent. Evaluations on NAS-Bench-101 (tabular) and NAS-Bench-301 (predictive) show that the approach scales to very large search spaces and can outperform baselines at low query budgets, though it becomes less robust to hyperparameter choices and slower to train on larger benchmarks. The work highlights the potential of re-usability of the learned search policy and the need for faster performance estimators to make RL-based NAS practical in real-world scenarios.
Abstract
In this publication, we assess the ability of a novel Reinforcement Learning-based solution to the problem of Neural Architecture Search, where a Reinforcement Learning (RL) agent learns to search for good architectures, rather than to return a single optimal architecture. We consider both the NAS-Bench-101 and NAS- Bench-301 settings, and compare against various known strong baselines, such as local search and random search. We conclude that our Reinforcement Learning agent displays strong scalability with regards to the size of the search space, but limited robustness to hyperparameter changes.
