Data Certification Strategies for Blockchain-based Traceability Systems
Giacomo Zonneveld, Giulia Rafaiani, Massimo Battaglioni, Marco Baldi
TL;DR
The paper tackles the challenge of certifying large, continuously generated data on a public blockchain. It compares a naive per-data hashing approach with a Merkle-tree based aggregation, develops practical Merkle-tree generation/proof extraction methods, and introduces a multi-level indexed Merkle tree to handle blockchain transaction failures efficiently. Through cost models and numerical results, the authors demonstrate that Merkle-tree based certification substantially reduces on-chain transactions for high-volume streams, while still enabling decentralized verification and data integrity. The proposed framework has practical implications for supply-chain traceability and attendance certification in public blockchains, and sets the stage for further security analyses and optimized decentralized authentication using robust Merkle-based structures.
Abstract
The use of blockchains for data certification and traceability is now well established in both the literature and practical applications. However, while blockchain-based certification of individual data is clear and straightforward, the use of blockchain to certify large amounts of data produced on a nearly continuous basis still poses some challenges. In such a case, in fact, it is first necessary to collect the data in an off-chain buffer, and then to organize it, e.g., via Merkle trees, in order to keep the size and quantity of certification data to be written to the blockchain small. In this paper, we consider a typical system for blockchain-based traceability of a production process, and propose and comparatively analyze some strategies for certifying the data of such a process on blockchain, while maintaining the possibility of verifying their certification in a decentralized way.
