ActNAS : Generating Efficient YOLO Models using Activation NAS

Sudhakar Sah; Ravish Kumar; Darshan C. Ganji; Ehsan Saboori

ActNAS : Generating Efficient YOLO Models using Activation NAS

Sudhakar Sah, Ravish Kumar, Darshan C. Ganji, Ehsan Saboori

TL;DR

This work proposes Activation NAS (Act-NAS)-a Hardware-Aware Neural Architecture Search (HANAS) method that optimizes activation functions per layer for specific hardware, and demonstrates that hardware-aware models learn to leverage architectural and compiler-level optimizations, resulting in highly efficient performance tailored to each hardware platform.

Abstract

Activation functions introduce non-linearity into Neural Networks, enabling them to learn complex patterns. Different activation functions vary in speed and accuracy, ranging from faster but less accurate options like ReLU to slower but more accurate functions like SiLU or SELU. Typically, same activation function is used throughout an entire model architecture. In this paper, we conduct a comprehensive study on the effects of using mixed activation functions in YOLO-based models, evaluating their impact on latency, memory usage, and accuracy across CPU, NPU, and GPU edge devices. We also propose a novel approach that leverages Neural Architecture Search (NAS) to design YOLO models with optimized mixed activation functions.The best model generated through this method demonstrates a slight improvement in mean Average Precision (mAP) compared to baseline model (SiLU), while it is 22.28% faster and consumes 64.15% less memory on the reference NPU device.

ActNAS : Generating Efficient YOLO Models using Activation NAS

TL;DR

Abstract

ActNAS : Generating Efficient YOLO Models using Activation NAS

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)