The KnowWhereGraph: A Large-Scale Geo-Knowledge Graph for Interdisciplinary Knowledge Discovery and Geo-Enrichment
Rui Zhu, Cogan Shimizu, Shirly Stephen, Colby K. Fisher, Thomas Thelen, Kitty Currier, Krzysztof Janowicz, Pascal Hitzler, Mark Schildhauer, Wenwen Li, Dean Rehberger, Adrita Barua, Antrea Christou, Ling Cai, Abhilekha Dalal, Anthony D'Onofrio, Andrew Eells, Mitchell Faulk, Zilong Liu, Gengchen Mai, Mohammad Saeid Mahdavinejad, Bryce Mecum, Sanaz Saki Norouzi, Meilin Shi, Yuanyuan Tian, Sizhe Wang, Zhangyu Wang, Joseph Zalewski
TL;DR
KnowWhereGraph presents a large-scale, RDF-based geospatial knowledge graph that unifies diverse human–environment datasets through a modular schema centered on space, place, and time. It integrates over 30 open data sources into a cross-domain knowledge warehouse, uses a Discrete Global Grid (DGG) with S2 cells for multi-scale geography, and employs SOSA/SSN and GeoSPARQL ontologies alongside DEO/HIP hazard vocabularies. The authors deliver a suite of tools (Knowledge Explorer, Geo-Enrichment, and domain-specific apps) that enable rapid, geo-enriched queries, visualization, and decision support for disaster response, supply-chain risk, and land valuation. This framework aims to empower decision-makers with AI-ready, semantically rich data and to facilitate global, cross-domain knowledge discovery through scalable, standards-based interoperability.
Abstract
Global challenges such as food supply chain disruptions, public health crises, and natural hazard responses require access to and integration of diverse datasets, many of which are geospatial. Over the past few years, a growing number of (geo)portals have been developed to address this need. However, most existing (geo)portals are stacked by separated or sparsely connected data "silos" impeding effective data consolidation. A new way of sharing and reusing geospatial data is therefore urgently needed. In this work, we introduce KnowWhereGraph, a knowledge graph-based data integration, enrichment, and synthesis framework that not only includes schemas and data related to human and environmental systems but also provides a suite of supporting tools for accessing this information. The KnowWhereGraph aims to address the challenge of data integration by building a large-scale, cross-domain, pre-integrated, FAIR-principles-based, and AI-ready data warehouse rooted in knowledge graphs. We highlight the design principles of KnowWhereGraph, emphasizing the roles of space, place, and time in bridging various data "silos". Additionally, we demonstrate multiple use cases where the proposed geospatial knowledge graph and its associated tools empower decision-makers to uncover insights that are often hidden within complex and poorly interoperable datasets.
