AyE-Edge: Automated Deployment Space Search Empowering Accuracy yet Efficient Real-Time Object Detection on the Edge
Chao Wu, Yifan Gong, Liangkai Liu, Mengquan Li, Yushu Wu, Xuan Shen, Zhimin Li, Geng Yuan, Weisong Shi, Yanzhi Wang
TL;DR
AyE-Edge tackles the fundamental trade-off in edge object detection among accuracy, latency, and power by introducing an automated deployment-space search across data, model, and hardware layers. It combines a tripartite framework—an optimized deployment space with T-Locality keyframes, a performance collector with a latency predictor, and a MARL-driven coordinator—to discover Pareto-optimal configurations under real-time constraints. The approach yields substantial power savings (up to 96.7% reductions against SOTA baselines) while maintaining competitive accuracy and throughput on mobile hardware. This work introduces a first-of-its-kind, coordinated pipeline for real-time Edge-OD that jointly optimizes input framing, model pruning, and DVFS across CPU and GPU, validated on real devices and public datasets, with practical implications for energy-aware edge AI systems.
Abstract
Object detection on the edge (Edge-OD) is in growing demand thanks to its ever-broad application prospects. However, the development of this field is rigorously restricted by the deployment dilemma of simultaneously achieving high accuracy, excellent power efficiency, and meeting strict real-time requirements. To tackle this dilemma, we propose AyE-Edge, the first-of-this-kind development tool that explores automated algorithm-device deployment space search to realize Accurate yet power-Efficient real-time object detection on the Edge. Through a collaborative exploration of keyframe selection, CPU-GPU configuration, and DNN pruning strategy, AyE-Edge excels in extensive real-world experiments conducted on a mobile device. The results consistently demonstrate AyE-Edge's effectiveness, realizing outstanding real-time performance, detection accuracy, and notably, a remarkable 96.7% reduction in power consumption, compared to state-of-the-art (SOTA) competitors.
