IoT-MCP: Bridging LLMs and IoT Systems Through Model Context Protocol
Ningyuan Yang, Guanliang Lyu, Mingchen Ma, Yiyi Lu, Yiming Li, Zhihui Gao, Hancheng Ye, Jianyi Zhang, Tingjun Chen, Yiran Chen
TL;DR
The paper tackles the challenge of integrating LLMs with heterogeneous IoT devices by proposing IoT-MCP, a decoupled three-domain framework that implements the Model Context Protocol (MCP) to standardize interactions between LLMs and physical devices. It introduces IoT-MCP Bench, a large-scale evaluation suite with 1,254 tasks across 22 sensors and 6 MCUs, assessing tool invocation, latency, and memory under realistic conditions. The framework demonstrates high performance, achieving 100% task success and about 205 ms average latency with modest memory footprints, validated through extensive experiments and a 12-hour real-world deployment. Key contributions include the decoupled Local Host, Datapool/Connection Server, and IoT Device architecture, an open-source implementation, and a rigorous benchmark methodology that enables reproducible evaluation of LLM-IoT systems. Overall, the work provides a practical, scalable pathway for real-world LLM-to-IoT integration with standardized assessment.
Abstract
The integration of Large Language Models (LLMs) with Internet-of-Things (IoT) systems faces significant challenges in hardware heterogeneity and control complexity. The Model Context Protocol (MCP) emerges as a critical enabler, providing standardized communication between LLMs and physical devices. We propose IoT-MCP, a novel framework that implements MCP through edge-deployed servers to bridge LLMs and IoT ecosystems. To support rigorous evaluation, we introduce IoT-MCP Bench, the first benchmark containing 114 Basic Tasks (e.g., ``What is the current temperature?'') and 1,140 Complex Tasks (e.g., ``I feel so hot, do you have any ideas?'') for IoT-enabled LLMs. Experimental validation across 22 sensor types and 6 microcontroller units demonstrates IoT-MCP's 100% task success rate to generate tool calls that fully meet expectations and obtain completely accurate results, 205ms average response time, and 74KB peak memory footprint. This work delivers both an open-source integration framework (https://github.com/Duke-CEI-Center/IoT-MCP-Servers) and a standardized evaluation methodology for LLM-IoT systems.
