Send Less, Save More: Energy-Efficiency Benchmark of Embedded CNN Inference vs. Data Transmission in IoT
Benjamin Karic, Nina Herrmann, Jan Stenkamp, Paula Scharf, Fabian Gieseke, Angela Schwering
TL;DR
This work benchmarks energy efficiency for IoT environmental monitoring by comparing cloud-based image processing against on-device CNN inference on ESP32-S3, paired with LPWAN transmissions. It evaluates two lightweight CNNs (MobileNetV2, SqueezeNet) trained and quantized for edge deployment on two domain datasets (PlantVillage and Cub), across multiple network protocols (LTE-M, NB-IoT, LoRaWAN) and sending strategies (image vs. inference results). Key findings show that edge inference with result-based transmission can reduce node energy by up to a factor of $5$, and that strategic scheduling (sending results only every few cycles) can yield up to about $14\times$ additional energy savings, significantly extending device lifetimes in remote settings. The results offer practical guidance on model choice, quantization, and network configurations to minimize power use in autonomous environmental monitoring deployments.
Abstract
The integration of the Internet of Things (IoT) and Artificial Intelligence offers significant opportunities to enhance our ability to monitor and address ecological changes. As environmental challenges become increasingly pressing, the need for effective remote monitoring solutions is more critical than ever. A major challenge in designing IoT applications for environmental monitoring - particularly those involving image data - is to create energy-efficient IoT devices capable of long-term operation in remote areas with limited power availability. Advancements in the field of Tiny Machine Learning allow the use of Convolutional Neural Networks (CNNs) on resource-constrained, battery-operated microcontrollers. Since data transfer is energy-intensive, performing inference directly on microcontrollers to reduce the message size can extend the operational lifespan of IoT nodes. This work evaluates the use of common Low Power Wide Area Networks and compressed CNNs trained on domain specific datasets on an ESP32-S3. Our experiments demonstrate, among other things, that executing CNN inference on-device and transmitting only the results reduces the overall energy consumption by a factor of up to five compared to sending raw image data. These findings advocate the development of IoT applications with reduced carbon footprint and capable of operating autonomously in environmental monitoring scenarios by incorporating EmbeddedML.
