Table of Contents
Fetching ...

DISL: Fueling Research with A Large Dataset of Solidity Smart Contracts

Gabriele Morello, Mojtaba Eshghie, Sofia Bobadilla, Martin Monperrus

TL;DR

The DISL dataset features a collection of $514,506$ unique Solidity files that have been deployed to Ethereum mainnet and surpasses existing datasets in size and recency.

Abstract

The DISL dataset features a collection of $514,506$ unique Solidity files that have been deployed to Ethereum mainnet. It caters to the need for a large and diverse dataset of real-world smart contracts. DISL serves as a resource for developing machine learning systems and for benchmarking software engineering tools designed for smart contracts. By aggregating every verified smart contract from Etherscan up to January 15, 2024, DISL surpasses existing datasets in size and recency.

DISL: Fueling Research with A Large Dataset of Solidity Smart Contracts

TL;DR

The DISL dataset features a collection of unique Solidity files that have been deployed to Ethereum mainnet and surpasses existing datasets in size and recency.

Abstract

The DISL dataset features a collection of unique Solidity files that have been deployed to Ethereum mainnet. It caters to the need for a large and diverse dataset of real-world smart contracts. DISL serves as a resource for developing machine learning systems and for benchmarking software engineering tools designed for smart contracts. By aggregating every verified smart contract from Etherscan up to January 15, 2024, DISL surpasses existing datasets in size and recency.
Paper Structure (11 sections, 3 tables)