CarbonCP: Carbon-Aware DNN Partitioning with Conformal Prediction for Sustainable Edge Intelligence
Hongyu Ke, Wanxin Jin, Haoxin Wang
TL;DR
The paper tackles the problem of high operational carbon emissions in end-to-end edge intelligence by proposing CarbonCP, a context-adaptive, carbon-aware, and uncertainty-aware DNN partitioning framework built on conformal prediction. It introduces an implicit predictor $Q_{\theta}$ to estimate partition performance, a CP-based score function, and a weighted CP scheme to handle covariate shift between calibration and test contexts. The approach yields prediction intervals for the optimal partition with statistically guaranteed coverage and enables three decision modes (R, M, A), with CarbonCP-A delivering the best carbon-footprint reductions in dynamic settings. Experimental results on a ResNet-18 deployment demonstrate substantial carbon reductions (up to $58.8\%$) while maintaining competitive latency and energy metrics, and show that carbon intensity and edge resource usage are critical levers for improvement.
Abstract
This paper presents a solution to address carbon emission mitigation for end-to-end edge computing systems, including the computing at battery-powered edge devices and servers, as well as the communications between them. We design and implement, CarbonCP, a context-adaptive, carbon-aware, and uncertainty-aware AI inference framework built upon conformal prediction theory, which balances operational carbon emissions, end-to-end latency, and battery consumption of edge devices through DNN partitioning under varying system processing contexts and carbon intensity. Our experimental results demonstrate that CarbonCP is effective in substantially reducing operational carbon emissions, up to 58.8%, while maintaining key user-centric performance metrics with only 9.9% error rate.
