Younger: The First Dataset for Artificial Intelligence-Generated Neural Network Architecture
Zhengxin Yang, Wanling Gao, Luzhou Peng, Yunyou Huang, Fei Tang, Jianfeng Zhan
TL;DR
Younger introduces a real-world DAG-based dataset of 7,629 neural network architectures derived from ~174K models across 30+ tasks to propel Artificial Intelligence-Generated Neural Network Architecture (AIGNNA). It formalizes two research paradigms: Local, for refinement of existing components, and Global, for end-to-end design from scratch, and demonstrates their feasibility with graph neural network baselines and a diffusion-based generator. The dataset, built from ONNX-converted models and deduplicated via Weisfeiler–Lehman hashing, also serves as a rich benchmark for GNN research due to its diverse, operator-level graph structures. Public release and an extension platform aim to democratize research and accelerate innovation in automated neural architecture design and GNN benchmarking.
Abstract
Designing and optimizing neural network architectures typically requires extensive expertise, starting with handcrafted designs and then manual or automated refinement. This dependency presents a significant barrier to rapid innovation. Recognizing the complexity of automatically generating neural network architecture from scratch, we introduce Younger, a pioneering dataset to advance this ambitious goal. Derived from over 174K real-world models across more than 30 tasks from various public model hubs, Younger includes 7,629 unique architectures, and each is represented as a directed acyclic graph with detailed operator-level information. The dataset facilitates two primary design paradigms: global, for creating complete architectures from scratch, and local, for detailed architecture component refinement. By establishing these capabilities, Younger contributes to a new frontier, Artificial Intelligence-Generated Neural Network Architecture (AIGNNA). Our experiments explore the potential and effectiveness of Younger for automated architecture generation and, as a secondary benefit, demonstrate that Younger can serve as a benchmark dataset, advancing the development of graph neural networks. We release the dataset and code publicly to lower the entry barriers and encourage further research in this challenging area.
