ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge
Daghash K. Alqahtani, Maria A. Rodriguez, Muhammad Aamir Cheema, Hamid Rezatofighi, Adel N. Toosi
TL;DR
ECORE addresses energy-constrained edge vision by introducing context-aware routing that dynamically directs image frames to the most suitable heterogeneous edge device–model pair. It uses lightweight object-count estimators at a central gateway and a greedy routing rule constrained by a tolerance δmAP, formalized as the feasible set ${\mathcal{F}}=\{ i\in \mathcal{M}_G \mid \text{mAP}_i \ge \text{mAP}_{\max}-\delta_{\text{mAP}}\}$ with $i^*=\arg\min_{i\in\mathcal{F}} e_i$, and provides proofs of optimality under these conditions. The approach introduces three object-count estimation methods—Edge Detection via Canny, an SSD-based gateway front-end, and an Output-Based reuse mechanism—to keep routing lightweight. Empirical evaluation on COCO, a balanced dataset, and a pedestrian video demonstrates notable energy and latency reductions (up to 35% and 49%, respectively) with only about a 2% drop in mAP compared with accuracy-focused baselines, validating the practical impact of content-aware routing for energy-efficient edge AI.
Abstract
Edge computing enables data processing closer to the source, significantly reducing latency, an essential requirement for real-time vision-based analytics such as object detection in surveillance and smart city environments. However, these tasks place substantial demands on resource-constrained edge devices, making the joint optimization of energy consumption and detection accuracy critical. To address this challenge, we propose ECORE, a framework that integrates multiple dynamic routing strategies, including a novel estimation-based techniques and an innovative greedy selection algorithm, to direct image processing requests to the most suitable edge device-model pair. ECORE dynamically balances energy efficiency and detection performance based on object characteristics. We evaluate our framework through extensive experiments on real-world datasets, comparing against widely used baseline techniques. The evaluation leverages established object detection models (YOLO, SSD, EfficientDet) and diverse edge platforms, including Jetson Orin Nano, Raspberry Pi 4 and 5, and TPU accelerators. Results demonstrate that our proposed context-aware routing strategies can reduce energy consumption and latency by 35% and 49%, respectively, while incurring only a 2% loss in detection accuracy compared to accuracy-centric methods.
