Digital Agriculture Sandbox for Collaborative Research
Osama Zafar, Rosemarie Santa González, Alfonso Morales, Erman Ayday
TL;DR
The paper addresses privacy barriers to agricultural data sharing that impede data-driven research and innovation. It introduces the Digital Agriculture Sandbox, a privacy-preserving, web-based platform that enables secure collaboration through federated learning, local differential privacy, and Principal Component Analysis. Key contributions include a modular, containerized architecture with identity management via TACC, privacy-preserving data operations (upload, similarity discovery, and distributed model training), and risk-aware model governance via a repository and analytics tools. The framework demonstrates practical use cases for farmers and researchers and outlines future work to expand models, explanations, risk analyses, safeguards, and a data-sharing component that preserves privacy.
Abstract
Digital agriculture is transforming the way we grow food by utilizing technology to make farming more efficient, sustainable, and productive. This modern approach to agriculture generates a wealth of valuable data that could help address global food challenges, but farmers are hesitant to share it due to privacy concerns. This limits the extent to which researchers can learn from this data to inform improvements in farming. This paper presents the Digital Agriculture Sandbox, a secure online platform that solves this problem. The platform enables farmers (with limited technical resources) and researchers to collaborate on analyzing farm data without exposing private information. We employ specialized techniques such as federated learning, differential privacy, and data analysis methods to safeguard the data while maintaining its utility for research purposes. The system enables farmers to identify similar farmers in a simplified manner without needing extensive technical knowledge or access to computational resources. Similarly, it enables researchers to learn from the data and build helpful tools without the sensitive information ever leaving the farmer's system. This creates a safe space where farmers feel comfortable sharing data, allowing researchers to make important discoveries. Our platform helps bridge the gap between maintaining farm data privacy and utilizing that data to address critical food and farming challenges worldwide.
