Recommending Pre-Trained Models for IoT Devices
Parth V. Patil, Wenxin Jiang, Huiyun Peng, Daniel Lugo, Kelechi G. Kalu, Josh LeBlanc, Lawrence Smith, Hyeonwoo Heo, Nathanael Aou, James C. Davis
TL;DR
This work addresses the gap that IoT hardware constraints are underrepresented in pre-trained model (PTM) recommendations. It proposes hardware-aware extensions to the Model Spider framework (Model Spider Fusion and Model Spider Shadow) to jointly consider task relevance and hardware feasibility, complemented by a data collection and Copeland-based ranking strategy. By outlining a concrete dataset and evaluation methodology, the paper lays groundwork for ground-truth IoT device rankings and scalable, energy-aware PTM deployment. The approach promises more reliable, resource-conscious PTM selection for IoT applications, and it sketches a broader research agenda to extend to distributed DL systems and more complex model reuse scenarios.
Abstract
The availability of pre-trained models (PTMs) has enabled faster deployment of machine learning across applications by reducing the need for extensive training. Techniques like quantization and distillation have further expanded PTM applicability to resource-constrained IoT hardware. Given the many PTM options for any given task, engineers often find it too costly to evaluate each model's suitability. Approaches such as LogME, LEEP, and ModelSpider help streamline model selection by estimating task relevance without exhaustive tuning. However, these methods largely leave hardware constraints as future work-a significant limitation in IoT settings. In this paper, we identify the limitations of current model recommendation approaches regarding hardware constraints and introduce a novel, hardware-aware method for PTM selection. We also propose a research agenda to guide the development of effective, hardware-conscious model recommendation systems for IoT applications.
