Surprisingly Strong Performance Prediction with Neural Graph Features
Gabriela Kadlecová, Jovita Lukasik, Martin Pilát, Petra Vidnerová, Mahmoud Safari, Roman Neruda, Frank Hutter
TL;DR
The paper addresses NAS performance prediction by identifying biases and limitations in zero-cost proxies and proposing neural graph features (GRAF) as a fast, interpretable graph-based descriptor of architectures. GRAF encodes operation counts, path lengths, and node degrees from architecture graphs and, when used with a simple random forest, often outperforms zero-cost proxies and common encodings across accuracy, hardware, and robustness tasks; combining GRAF with zero-cost proxies yields the best results in many settings. The work provides extensive ablations, feature-importance analyses, and redundancy studies, showing that different tasks favor different architectural properties and that GRAF offers a practical, scalable baseline with broad applicability in NAS. Overall, GRAF acts as a strong, lightweight predictor that complements existing methods and can guide more efficient NAS, while highlighting avenues for designing task-tailored, graph-based proxies for future research.
Abstract
Performance prediction has been a key part of the neural architecture search (NAS) process, allowing to speed up NAS algorithms by avoiding resource-consuming network training. Although many performance predictors correlate well with ground truth performance, they require training data in the form of trained networks. Recently, zero-cost proxies have been proposed as an efficient method to estimate network performance without any training. However, they are still poorly understood, exhibit biases with network properties, and their performance is limited. Inspired by the drawbacks of zero-cost proxies, we propose neural graph features (GRAF), simple to compute properties of architectural graphs. GRAF offers fast and interpretable performance prediction while outperforming zero-cost proxies and other common encodings. In combination with other zero-cost proxies, GRAF outperforms most existing performance predictors at a fraction of the cost.
