Table of Contents
Fetching ...

D-VRE: From a Jupyter-enabled Private Research Environment to Decentralized Collaborative Research Ecosystem

Yuandou Wang, Sheejan Tripathi, Siamak Farshidi, Zhiming Zhao

TL;DR

D-VRE proposes a decentralized virtual research environment that fuses a Jupyter-enabled private research workspace with Ethereum-based blockchain governance to enable secure data sharing and collaborative workflows. The framework defines a conceptual architecture with components for decentralized identity, policy authoring, asset management, and provenance, supported by five logical layers and on-chain metadata. A PoC implements four smart contracts (UserMetadataFactory, UserMetadata, PolicyManager, GroupContract), Lit Protocol-enabled content encryption for IPFS-stored data, and JupyterLab integration with MetaMask, demonstrating practical feasibility on a testnet and validating usability through a user study. While promising in enabling decentralized collaboration, the work also identifies open challenges in trust models, energy efficiency, and community adoption, outlining future work to complete remaining modules and enhance FAIR data governance.

Abstract

Today, scientific research is increasingly data-centric and compute-intensive, relying on data and models across distributed sources. However, it still faces challenges in the traditional cooperation mode, due to the high storage and computing cost, geo-location barriers, and local confidentiality regulations. The Jupyter environment has recently emerged and evolved as a vital virtual research environment for scientific computing, which researchers can use to scale computational analyses up to larger datasets and high-performance computing resources. Nevertheless, existing approaches lack robust support of a decentralized cooperation mode to unlock the full potential of decentralized collaborative scientific research, e.g., seamlessly secure data sharing. In this work, we change the basic structure and legacy norms of current research environments via the seamless integration of Jupyter with Ethereum blockchain capabilities. As such, it creates a Decentralized Virtual Research Environment (D-VRE) from private computational notebooks to decentralized collaborative research ecosystem. We propose a novel architecture for the D-VRE and prototype some essential D-VRE elements for enabling secure data sharing with decentralized identity, user-centric agreement-making, membership, and research asset management. To validate our method, we conducted an experimental study to test all functionalities of D-VRE smart contracts and their gas consumption. In addition, we deployed the D-VRE prototype on a test net of the Ethereum blockchain for demonstration. The feedback from the studies showcases the current prototype's usability, ease of use, and potential and suggests further improvements.

D-VRE: From a Jupyter-enabled Private Research Environment to Decentralized Collaborative Research Ecosystem

TL;DR

D-VRE proposes a decentralized virtual research environment that fuses a Jupyter-enabled private research workspace with Ethereum-based blockchain governance to enable secure data sharing and collaborative workflows. The framework defines a conceptual architecture with components for decentralized identity, policy authoring, asset management, and provenance, supported by five logical layers and on-chain metadata. A PoC implements four smart contracts (UserMetadataFactory, UserMetadata, PolicyManager, GroupContract), Lit Protocol-enabled content encryption for IPFS-stored data, and JupyterLab integration with MetaMask, demonstrating practical feasibility on a testnet and validating usability through a user study. While promising in enabling decentralized collaboration, the work also identifies open challenges in trust models, energy efficiency, and community adoption, outlining future work to complete remaining modules and enhance FAIR data governance.

Abstract

Today, scientific research is increasingly data-centric and compute-intensive, relying on data and models across distributed sources. However, it still faces challenges in the traditional cooperation mode, due to the high storage and computing cost, geo-location barriers, and local confidentiality regulations. The Jupyter environment has recently emerged and evolved as a vital virtual research environment for scientific computing, which researchers can use to scale computational analyses up to larger datasets and high-performance computing resources. Nevertheless, existing approaches lack robust support of a decentralized cooperation mode to unlock the full potential of decentralized collaborative scientific research, e.g., seamlessly secure data sharing. In this work, we change the basic structure and legacy norms of current research environments via the seamless integration of Jupyter with Ethereum blockchain capabilities. As such, it creates a Decentralized Virtual Research Environment (D-VRE) from private computational notebooks to decentralized collaborative research ecosystem. We propose a novel architecture for the D-VRE and prototype some essential D-VRE elements for enabling secure data sharing with decentralized identity, user-centric agreement-making, membership, and research asset management. To validate our method, we conducted an experimental study to test all functionalities of D-VRE smart contracts and their gas consumption. In addition, we deployed the D-VRE prototype on a test net of the Ethereum blockchain for demonstration. The feedback from the studies showcases the current prototype's usability, ease of use, and potential and suggests further improvements.
Paper Structure (28 sections, 8 figures, 2 algorithms)

This paper contains 28 sections, 8 figures, 2 algorithms.

Figures (8)

  • Figure 1: An overview of comparisons of existing research environment exemplars by comparing the ways of identity management (), sharing policy (), assets transfer (), and workflow orchestration ().
  • Figure 2: An overview of the D-VRE conceptual framework with the key components and logical layers.
  • Figure 3: An overview of the D-VRE architecture, in conjunction with the individual's private research environment (left), D-VRE middleware layer and the management of asset, workflow, and infrastructure above (middle), and different potential customers in the blockchain network (right).
  • Figure 4: Example of the GroupContract smart contract.
  • Figure 5: The screenshot of the embedded D-VRE components in the JupyterLab environment: ❶ Auth via MetaMask, ❷ Make Agreements, ❸ Membership, ❹ Asset Manager.
  • ...and 3 more figures