EvGNN: An Event-driven Graph Neural Network Accelerator for Edge Vision
Yufeng Yang, Adrian Kneip, Charlotte Frenkel
TL;DR
EvGNN addresses the need for ultra-low-latency, energy-efficient edge vision by enabling end-to-end hardware acceleration of event-driven GNNs. It introduces directed dynamic graphs, a spatiotemporally decoupled prism neighbor search, and layer-parallel execution to perform per-event GNN updates with minimal memory and latency. The approach achieves 87.8% accuracy on N-CARS with an average per-event latency of 16 μs on a Xilinx KV260, and a scalable ASIC-like model projects sub-10 μs latency and low energy for future edge implementations. By contrasting with synchronous GNN accelerators, EvGNN demonstrates how event-driven, local updates can deliver real-time vision intelligence at the edge with compact hardware footprints.
Abstract
Edge vision systems combining sensing and embedded processing promise low-latency, decentralized, and energy-efficient solutions that forgo reliance on the cloud. As opposed to conventional frame-based vision sensors, event-based cameras deliver a microsecond-scale temporal resolution with sparse information encoding, thereby outlining new opportunities for edge vision systems. However, mainstream algorithms for frame-based vision, which mostly rely on convolutional neural networks (CNNs), can hardly exploit the advantages of event-based vision as they are typically optimized for dense matrix-vector multiplications. While event-driven graph neural networks (GNNs) have recently emerged as a promising solution for sparse event-based vision, their irregular structure is a challenge that currently hinders the design of efficient hardware accelerators. In this paper, we propose EvGNN, the first event-driven GNN accelerator for low-footprint, ultra-low-latency, and high-accuracy edge vision with event-based cameras. It relies on three central ideas: (i) directed dynamic graphs exploiting single-hop nodes with edge-free storage, (ii) event queues for the efficient identification of local neighbors within a spatiotemporally decoupled search range, and (iii) a novel layer-parallel processing scheme allowing for a low-latency execution of multi-layer GNNs. We deployed EvGNN on a Xilinx KV260 Ultrascale+ MPSoC platform and benchmarked it on the N-CARS dataset for car recognition, demonstrating a classification accuracy of 87.8% and an average latency per event of 16$μ$s, thereby enabling real-time, microsecond-resolution event-based vision at the edge.
