Self-consistent Deep Geometric Learning for Heterogeneous Multi-source Spatial Point Data Prediction
Dazhou Yu, Xiaoyun Gong, Yun Li, Meikang Qiu, Liang Zhao
TL;DR
The paper tackles multi-source spatial point data prediction under limited ground truth by introducing Deep Multi-source Spatial Prediction (DMSP). It combines a self-supervised mutual-information objective with learnable fidelity scores to weigh heterogeneous data sources and a geo-location-aware multi-source graph neural network to model spatial relationships. Key contributions include a novel fidelity score mechanism, a shared spatial relation encoder, and source-specific GNN convolutions, validated across real and synthetic datasets with ablations. Results show consistent improvements over state-of-the-art methods and provide practical insights into data source quality and spatial integration, with scalable training and robust hyperparameter behavior.
Abstract
Multi-source spatial point data prediction is crucial in fields like environmental monitoring and natural resource management, where integrating data from various sensors is the key to achieving a holistic environmental understanding. Existing models in this area often fall short due to their domain-specific nature and lack a strategy for integrating information from various sources in the absence of ground truth labels. Key challenges include evaluating the quality of different data sources and modeling spatial relationships among them effectively. Addressing these issues, we introduce an innovative multi-source spatial point data prediction framework that adeptly aligns information from varied sources without relying on ground truth labels. A unique aspect of our method is the 'fidelity score,' a quantitative measure for evaluating the reliability of each data source. Furthermore, we develop a geo-location-aware graph neural network tailored to accurately depict spatial relationships between data points. Our framework has been rigorously tested on two real-world datasets and one synthetic dataset. The results consistently demonstrate its superior performance over existing state-of-the-art methods.
