Table of Contents
Fetching ...

ApproxPilot: A GNN-based Accelerator Approximation Framework

Qing Zhang, Cheng Liu, Siting Liu, Yajuan Hui, Huawei Li, Xiaowei Li

TL;DR

This work tackles the challenge of designing energy-efficient hardware accelerators for error-tolerant applications by navigating a vast space of approximate arithmetic units. It introduces ApproxPilot, an end-to-end framework that uses a topology-aware Graph Neural Network to predict PPA and accuracy, complemented by a two-stage critical-path aware model to improve latency forecasts. The framework includes design-space pruning (invalid and redundant candidates), a principled dataset construction with unit- and accelerator-level features, and NSGA-III-based design-space exploration to generate Pareto-optimal configurations. Empirical results on Sobel, Gaussian, and KMeans benchmarks show that ApproxPilot achieves superior Pareto fronts and reduced hardware overhead compared to AutoAX, demonstrating practical benefits for automated accelerator approximation.

Abstract

A typical optimization of customized accelerators for error-tolerant applications such as multimedia, recognition, and classification is to replace traditional arithmetic units like multipliers and adders with the approximate ones to enhance energy efficiency while adhering to accuracy requirements. However, the plethora of arithmetic units and diverse approximate unit options result in an exceedingly large design space. Therefore, there is a pressing need for an end-to-end design framework capable of navigating this intricate design space for approximation optimization. Traditional methods relying on simulation-based or blackbox model evaluations suffer from either high computational costs or limitations in accuracy and scalability, posing significant challenges to the optimization process. In this paper, we propose a Graph Neural Network (GNN) model that leverages the physical connections of arithmetic units to capture their influence on the performance, power, area (PPA), and accuracy of the accelerator. Particularly, we notice that critical path plays a key role in node feature of the GNN model and having it embedded in the feature vector greatly enhances the prediction quality of the models. On top of the models that allow rapid and efficient PPA and accuracy prediction of various approximate accelerator configurations, we can further explore the large design space effectively and build an end-to-end accelerator approximation framework named ApproxPilot to optimize the accelerator approximation. Our experimental results demonstrate that ApproxPilot outperforms state-of-the-art approximation optimization frameworks in both performance and hardware overhead with the same accuracy constraints.

ApproxPilot: A GNN-based Accelerator Approximation Framework

TL;DR

This work tackles the challenge of designing energy-efficient hardware accelerators for error-tolerant applications by navigating a vast space of approximate arithmetic units. It introduces ApproxPilot, an end-to-end framework that uses a topology-aware Graph Neural Network to predict PPA and accuracy, complemented by a two-stage critical-path aware model to improve latency forecasts. The framework includes design-space pruning (invalid and redundant candidates), a principled dataset construction with unit- and accelerator-level features, and NSGA-III-based design-space exploration to generate Pareto-optimal configurations. Empirical results on Sobel, Gaussian, and KMeans benchmarks show that ApproxPilot achieves superior Pareto fronts and reduced hardware overhead compared to AutoAX, demonstrating practical benefits for automated accelerator approximation.

Abstract

A typical optimization of customized accelerators for error-tolerant applications such as multimedia, recognition, and classification is to replace traditional arithmetic units like multipliers and adders with the approximate ones to enhance energy efficiency while adhering to accuracy requirements. However, the plethora of arithmetic units and diverse approximate unit options result in an exceedingly large design space. Therefore, there is a pressing need for an end-to-end design framework capable of navigating this intricate design space for approximation optimization. Traditional methods relying on simulation-based or blackbox model evaluations suffer from either high computational costs or limitations in accuracy and scalability, posing significant challenges to the optimization process. In this paper, we propose a Graph Neural Network (GNN) model that leverages the physical connections of arithmetic units to capture their influence on the performance, power, area (PPA), and accuracy of the accelerator. Particularly, we notice that critical path plays a key role in node feature of the GNN model and having it embedded in the feature vector greatly enhances the prediction quality of the models. On top of the models that allow rapid and efficient PPA and accuracy prediction of various approximate accelerator configurations, we can further explore the large design space effectively and build an end-to-end accelerator approximation framework named ApproxPilot to optimize the accelerator approximation. Our experimental results demonstrate that ApproxPilot outperforms state-of-the-art approximation optimization frameworks in both performance and hardware overhead with the same accuracy constraints.
Paper Structure (18 sections, 4 equations, 6 figures, 8 tables)

This paper contains 18 sections, 4 equations, 6 figures, 8 tables.

Figures (6)

  • Figure 1: ApproxPilot Overview
  • Figure 2: Graph Abstraction
  • Figure 3: Critical Path Aware Model Architecture
  • Figure 4: Pareto frontier curves generated by ApproxPilot and AutoAX.
  • Figure 5: Latency prediction model comparison among random forest, baseline single-stage GNN, and critical path aware two-stage GNN.
  • ...and 1 more figures