CAP: A Context-Aware Neural Predictor for NAS
Han Ji, Yuqi Feng, Yanan Sun
TL;DR
This work tackles the high annotation cost of neural predictors in neural architecture search (NAS). It introduces CAP, a context-aware neural predictor that pre-trains on unlabeled architectures using a context-aware self-supervised task over graph representations, enabling expressive, generalizable architecture embeddings with few labeled examples. CAP achieves state-of-the-art ranking and efficient search across NAS-Bench-101, NAS-Bench-201, and DARTS spaces, often using substantially fewer annotated architectures than prior predictors. Ablation studies corroborate the effectiveness of the context-aware pre-training and the proposed fine-tuning and loss strategies, highlighting CAP's practical potential for accelerating NAS.
Abstract
Neural predictors are effective in boosting the time-consuming performance evaluation stage in neural architecture search (NAS), owing to their direct estimation of unseen architectures. Despite the effectiveness, training a powerful neural predictor with fewer annotated architectures remains a huge challenge. In this paper, we propose a context-aware neural predictor (CAP) which only needs a few annotated architectures for training based on the contextual information from the architectures. Specifically, the input architectures are encoded into graphs and the predictor infers the contextual structure around the nodes inside each graph. Then, enhanced by the proposed context-aware self-supervised task, the pre-trained predictor can obtain expressive and generalizable representations of architectures. Therefore, only a few annotated architectures are sufficient for training. Experimental results in different search spaces demonstrate the superior performance of CAP compared with state-of-the-art neural predictors. In particular, CAP can rank architectures precisely at the budget of only 172 annotated architectures in NAS-Bench-101. Moreover, CAP can help find promising architectures in both NAS-Bench-101 and DARTS search spaces on the CIFAR-10 dataset, serving as a useful navigator for NAS to explore the search space efficiently.
