Multi-Objective Neural Architecture Search by Learning Search Space Partitions
Yiyang Zhao, Linnan Wang, Tian Guo
TL;DR
This work introduces LaMOO, a learning-based meta-optimizer for multi-objective neural architecture search (NAS) that partitions the search space into promising and non-promising regions and focuses sampling within these regions using Monte Carlo Tree Search. By combining space-partition learning with sampling strategies (random, qEHVI, CMA-ES) and integrating with one-shot, few-shot, and predictor-based NAS, LaMOO achieves substantial gains in sample efficiency and Pareto-front quality across NasBench201, NasBench301, HW-NAS-Bench, and open-domain tasks (CIFAR-10, ImageNet, Penn Treebank, MS COCO). The approach leverages dominance-based region labeling, SVM-based partitions, and hypervolume as a guiding metric, while exploring leaf-based MCTS to reduce hypervolume computation costs in many-objective settings. Empirically, LaMOO yields state-of-the-art or competitive performance with dramatically fewer samples and lower resource requirements, demonstrating strong practical impact for hardware-aware and open-domain NAS. The work also analyzes design choices (sampling methods, kernel types, and MCTS strategies) and suggests avenues for extending space-partitioning techniques to other AutoML and ML-design problems.
Abstract
Deploying deep learning models requires taking into consideration neural network metrics such as model size, inference latency, and #FLOPs, aside from inference accuracy. This results in deep learning model designers leveraging multi-objective optimization to design effective deep neural networks in multiple criteria. However, applying multi-objective optimizations to neural architecture search (NAS) is nontrivial because NAS tasks usually have a huge search space, along with a non-negligible searching cost. This requires effective multi-objective search algorithms to alleviate the GPU costs. In this work, we implement a novel multi-objectives optimizer based on a recently proposed meta-algorithm called LaMOO on NAS tasks. In a nutshell, LaMOO speedups the search process by learning a model from observed samples to partition the search space and then focusing on promising regions likely to contain a subset of the Pareto frontier. Using LaMOO, we observe an improvement of more than 200% sample efficiency compared to Bayesian optimization and evolutionary-based multi-objective optimizers on different NAS datasets. For example, when combined with LaMOO, qEHVI achieves a 225% improvement in sample efficiency compared to using qEHVI alone in NasBench201. For real-world tasks, LaMOO achieves 97.36% accuracy with only 1.62M #Params on CIFAR10 in only 600 search samples. On ImageNet, our large model reaches 80.4% top-1 accuracy with only 522M #FLOPs.
