Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between

Ondřej Týbl; Lukáš Neumann

Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between

Ondřej Týbl, Lukáš Neumann

TL;DR

The paper tackles the limitation that NAS often fails to outperform hand-crafted architectures and is constrained by narrow benchmarks. It introduces UniNAS, a universal graph-based search space that unifies CNNs, transformers, and hybrids, along with a training-free proxy and a standardized toolkit for reproducible NAS. An architecture search algorithm navigates the UniNAS space to discover novel topologies, including UniNAS-A, which surpasses hand-crafted baselines under identical training conditions. The work demonstrates strong performance on ImageNet and downstream tasks, and provides a publicly available toolkit to foster fair comparisons and broader adoption in NAS research.

Abstract

We introduce Universal Neural Architecture Space (UniNAS), a generic search space for neural architecture search (NAS) which unifies convolutional networks, transformers, and their hybrid architectures under a single, flexible framework. Our approach enables discovery of novel architectures as well as analyzing existing architectures in a common framework. We also propose a new search algorithm that allows traversing the proposed search space, and demonstrate that the space contains interesting architectures, which, when using identical training setup, outperform state-of-the-art hand-crafted architectures. Finally, a unified toolkit including a standardized training and evaluation protocol is introduced to foster reproducibility and enable fair comparison in NAS research. Overall, this work opens a pathway towards systematically exploring the full spectrum of neural architectures with a unified graph-based NAS perspective.

Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between

TL;DR

Abstract

Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)