Sims: An Interactive Tool for Geospatial Matching and Clustering
Akram Zaytar, Girmaw Abebe Tadesse, Caleb Robinson, Eduardo G. Bendito, Medha Devare, Meklit Chernet, Gilles Q. Hacheme, Rahul Dodhia, Juan M. Lavista Ferres
TL;DR
The paper tackles the bottleneck of slow geospatial feature discovery in large spatio-temporal domains by introducing Sims, a no-code tool that leverages Google Earth Engine to perform clustering and similarity search over user-defined regions. The approach focuses on feature exploration rather than model building, enabling rapid identification of predictive geospatial variables and regions of interest. A Rwanda case study using simulated maize yield data demonstrates how different combinations of soil, weather, and agronomy features yield distinct yield-response zones, with strong statistical separation at $K=5$ (e.g., $p<2\times10^{-16}$). Sims is open-source, reduces computational demands by offloading work to GEE, and supports downstream analysis through downloadable raster outputs, with future work aimed at improving workflow persistence, scalability, and automated feature selection, broadening its practical impact for geospatial modeling and decision support.
Abstract
Acquiring, processing, and visualizing geospatial data requires significant computing resources, especially for large spatio-temporal domains. This challenge hinders the rapid discovery of predictive features, which is essential for advancing geospatial modeling. To address this, we developed Similarity Search (Sims), a no-code web tool that allows users to perform clustering and similarity search over defined regions of interest using Google Earth Engine as a backend. Sims is designed to complement existing modeling tools by focusing on feature exploration rather than model creation. We demonstrate the utility of Sims through a case study analyzing simulated maize yield data in Rwanda, where we evaluate how different combinations of soil, weather, and agronomic features affect the clustering of yield response zones. Sims is open source and available at https://github.com/microsoft/Sims
