CoLLM-NAS: Collaborative Large Language Models for Efficient Knowledge-Guided Neural Architecture Search

Zhe Li; Zhiwei Lin; Yongtao Wang

CoLLM-NAS: Collaborative Large Language Models for Efficient Knowledge-Guided Neural Architecture Search

Zhe Li, Zhiwei Lin, Yongtao Wang

TL;DR

CoLLM-NAS tackles the inefficiency and validity issues of neural architecture search by introducing a collaborative two-stage framework that pairs a stateful Navigator LLM with a stateless Generator LLM, coordinated by a management module and guided by inherited supernet weights. This knowledge-guided search leverages LLM priors and progressive trajectory information to rapidly steer the search toward high-performing architectures across multiple spaces, including ImageNet-scale tasks. Empirical results demonstrate state-of-the-art or competitive performance on ImageNet and NAS-Bench-201, with substantial reductions in search cost (3–10x) and improved robustness across ablations, prompt variations, and LLM choices. The framework also generalizes across different NAS baselines (OFA, SPOS, AutoFormer) and search spaces (MobileNet, ShuffleNet, AutoFormer), suggesting broad applicability of trajectory-to-strategy-to-solution search in neural architecture design.

Abstract

The integration of Large Language Models (LLMs) with Neural Architecture Search (NAS) has introduced new possibilities for automating the design of neural architectures. However, most existing methods face critical limitations, including architectural invalidity, computational inefficiency, and inferior performance compared to traditional NAS. In this work, we present Collaborative LLM-based NAS (CoLLM-NAS), a two-stage NAS framework with knowledge-guided search driven by two complementary LLMs. Specifically, we propose a Navigator LLM to guide search direction and a Generator LLM to synthesize high-quality candidates, with a dedicated Coordinator module to manage their interaction. CoLLM-NAS efficiently guides the search process by combining LLMs' inherent knowledge of structured neural architectures with progressive knowledge from iterative feedback and historical trajectory. Experimental results on ImageNet and NAS-Bench-201 show that CoLLM-NAS surpasses existing NAS methods and conventional search algorithms, achieving new state-of-the-art results. Furthermore, CoLLM-NAS consistently enhances the performance and efficiency of various two-stage NAS methods (e.g., OFA, SPOS, and AutoFormer) across diverse search spaces (e.g., MobileNet, ShuffleNet, and AutoFormer), demonstrating its excellent generalization.

CoLLM-NAS: Collaborative Large Language Models for Efficient Knowledge-Guided Neural Architecture Search

TL;DR

Abstract

CoLLM-NAS: Collaborative Large Language Models for Efficient Knowledge-Guided Neural Architecture Search

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)