Building Optimal Neural Architectures using Interpretable Knowledge
Keith G. Mills, Fred X. Han, Mohammad Salameh, Shengyao Lu, Chunhua Zhou, Jiao He, Fengyu Sun, Di Niu
TL;DR
The paper tackles the high cost of Neural Architecture Search by presenting AutoBuild, a method that learns to rank architectural building blocks through a magnitude-ranked, hop-aware embedding space learned by graph neural networks. By aligning subgraph and node embeddings with end-task performance via a differentiable ranking loss and a feature embedding module, AutoBuild assigns interpretable importance scores to architectural modules and can directly assemble high-performing architectures or prune search spaces. The approach is demonstrated across ImageNet macro-search spaces, panoptic segmentation, and generative AI with limited evaluations, showing improved Pareto fronts and FID metrics with far fewer evaluated architectures. This work reduces NAS cost while providing interpretable guidance on which modules and features drive performance, enabling more efficient and scalable architecture design in CV and generative modeling tasks.
Abstract
Neural Architecture Search is a costly practice. The fact that a search space can span a vast number of design choices with each architecture evaluation taking nontrivial overhead makes it hard for an algorithm to sufficiently explore candidate networks. In this paper, we propose AutoBuild, a scheme which learns to align the latent embeddings of operations and architecture modules with the ground-truth performance of the architectures they appear in. By doing so, AutoBuild is capable of assigning interpretable importance scores to architecture modules, such as individual operation features and larger macro operation sequences such that high-performance neural networks can be constructed without any need for search. Through experiments performed on state-of-the-art image classification, segmentation, and Stable Diffusion models, we show that by mining a relatively small set of evaluated architectures, AutoBuild can learn to build high-quality architectures directly or help to reduce search space to focus on relevant areas, finding better architectures that outperform both the original labeled ones and ones found by search baselines. Code available at https://github.com/Ascend-Research/AutoBuild
