Combining Neural Architecture Search and Automatic Code Optimization: A Survey
Inas Bachiri, Hadjer Benmeziane, Smail Niar, Riyadh Baghdadi, Hamza Ouarnoughi, Abdelkrime Aries
TL;DR
This survey tackles the problem of sub-optimal hardware efficiency when HW-NAS and Automatic Code Optimization are applied in isolation. It proposes NACOS (Hardware Aware-Neural Architecture and Compiler Optimizations co-Search) as a joint co-search framework that simultaneously optimizes neural architectures and their compiler schedules to better reflect hardware performance. The paper provides a taxonomy of two-stage and one-stage NACOS methods, surveys existing approaches, and discusses search strategies, evaluation methodologies, and key challenges such as hardware heterogeneity, generalization, and the lack of benchmarks. By highlighting cross-level synergies and outlining future directions, the work aims to enable more accurate hardware-aware optimization and broader applicability across devices and domains.
Abstract
Deep Learning models have experienced exponential growth in complexity and resource demands in recent years. Accelerating these models for efficient execution on resource-constrained devices has become more crucial than ever. Two notable techniques employed to achieve this goal are Hardware-aware Neural Architecture Search (HW-NAS) and Automatic Code Optimization (ACO). HW-NAS automatically designs accurate yet hardware-friendly neural networks, while ACO involves searching for the best compiler optimizations to apply on neural networks for efficient mapping and inference on the target hardware. This survey explores recent works that combine these two techniques within a single framework. We present the fundamental principles of both domains and demonstrate their sub-optimality when performed independently. We then investigate their integration into a joint optimization process that we call Hardware Aware-Neural Architecture and Compiler Optimizations co-Search (NACOS).
