PolyLink: A Blockchain Based Decentralized Edge AI Platform for LLM Inference
Hongbo Liu, Jiannong Cao, Bo Yang, Dongbin Bai, Yinfeng Cao, Xiaoming Shen, Yinan Zhang, Jinwen Liang, Shan Jiang, Mingjin Zhang
TL;DR
PolyLink tackles the centralization of LLM inference by introducing a blockchain-driven decentralized edge AI platform that supports single- and cross-device execution, a TIQE protocol for trustless quality evaluation, and a dynamic token-based incentive model. It combines EdgeShard-based cross-device inference with a hybrid quality assessment (Cross-Encoder + LLM-as-a-Judge) to balance latency and accuracy, and uses VRF-based validator elections with median-score consensus and slashing to maintain integrity. Real-world deployment across geo-distributed edge devices demonstrates practical latency, scalable TIQE performance, and reward distributions that align with contribution quality and stake. The work advances decentralized AI deployment in DePIN contexts and offers a scalable, incentive-aligned framework for edge-based LLM services, while highlighting constraints such as the <1/3> security assumption and network-induced latency in cross-device setups.
Abstract
The rapid advancement of large language models (LLMs) in recent years has revolutionized the AI landscape. However, the deployment model and usage of LLM services remain highly centralized, creating significant trust issues and costs for end users and developers. To address these issues, we propose PolyLink, a blockchain-based decentralized AI platform that decentralizes LLM development and inference. Specifically, PolyLink introduces a decentralized crowdsourcing architecture that supports single-device and cross-device model deployment and inference across heterogeneous devices at the edge. Moreover, to ensure the inference integrity, we design the TIQE protocol, which combines a lightweight cross-encoder model and an LLM-as-a-Judge for a high-accuracy inference evaluation. Lastly, we integrate a comprehensive token-based incentive model with dynamic pricing and reward mechanisms for all participants. We have deployed PolyLink and conducted an extensive real-world evaluation through geo-distributed deployment across heterogeneous devices. Results indicate that the inference and verification latency is practical. Our security analysis demonstrates that the system is resistant to model degradation attacks and validator corruptions. PolyLink is now available at https://github.com/IMCL-PolyLink/PolyLink.
