Sustainable Open-Data Management for Field Research: A Cloud-Based Approach in the Underlandscape Project
Augusto Ciuffoletti, Letizia Chiti
TL;DR
The paper tackles the problem of long-term accessibility for field-based research data by proposing a cloud-centric, low-maintenance ICT architecture that relies on public cloud services and a lightweight, GitHub-hosted data workflow. It defines a GeoJSON-based dataset format with six feature types and metadata, stored as standalone files in a Master repository, and supports on-site and desk-based data creation, offline editing with a dedicated editor, and multiple delivery methods (uMap visualizations, dataset repositories, and a visitor-focused Turista app). Key contributions include the Off editor, the gaiaSplit.js conversion tool, the repo_sync.py and umap_sync.py automation scripts, and a dual delivery pathway that combines interactive maps with structured repositories to ensure both usability and longevity. The approach emphasizes sustainability by minimizing bespoke infrastructure, enabling offline operation, and maintaining data in a stable, openly accessible platform (GitHub) while planning for future scalability and integration with additional tools. This work provides a practical reference model for similar projects aiming to preserve open research data beyond project lifetimes, with broad applicability to multidisciplinary field studies and open-data ecosystems.
Abstract
Field-based research projects require a robust suite of ICT services to support data acquisition, documentation, storage, and dissemination. A key challenge lies in ensuring the sustainability of data management - not only during the project's funded period but also beyond its conclusion, when maintenance and support often depend on voluntary efforts. In the Underlandscape project, we tackled this challenge by extensively leveraging public cloud services while minimizing reliance on complex custom infrastructure. This paper provides a comprehensive overview of the project's final infrastructure, detailing the adopted data formats, the cloud-based solutions enabling data management, and the custom applications developed for system integration.
