Local vs. Global: Local Land-Use and Land-Cover Models Deliver Higher Quality Maps
Girmaw Abebe Tadesse, Caleb Robinson, Charles Mwangi, Esther Maina, Joshua Nyakundi, Luana Marotti, Gilles Quentin Hacheme, Hamed Alemohammad, Rahul Dodhia, Juan M. Lavista Ferres
TL;DR
This paper addresses the poor accuracy and inconsistencies of global LULC maps in Africa by introducing a data-centric, teacher-student framework (DATS) that leverages high-resolution Maxar imagery for a local, high-detail teacher and freely available Sentinel-2 data for a scalable student, with knowledge transfer between the models. Applied to Murang'a county, Kenya, the approach yields higher-quality LULC maps than leading global maps, achieving an external F1 of $0.96$ and IoU of $0.86$ for the teacher, and improvements of $0.14$ in F1 and $0.21$ in IoU for the student relative to the best global baseline. The study also highlights substantial inconsistencies among existing global maps when evaluated locally, emphasizing the need for region-specific modeling to support food-security decisions and policy compliance. The proposed framework demonstrates practical impact through downstream use in crop type mapping and yield estimation, with clear plans to scale to larger regions and incorporate temporal dynamics for longitudinal monitoring.
Abstract
In 2023, 58.0% of the African population experienced moderate to severe food insecurity, with 21.6% facing severe food insecurity. Land-use and land-cover maps provide crucial insights for addressing food insecurity by improving agricultural efforts, including mapping and monitoring crop types and estimating yield. The development of global land-cover maps has been facilitated by the increasing availability of earth observation data and advancements in geospatial machine learning. However, these global maps exhibit lower accuracy and inconsistencies in Africa, partly due to the lack of representative training data. To address this issue, we propose a data-centric framework with a teacher-student model setup, which uses diverse data sources of satellite images and label examples to produce local land-cover maps. Our method trains a high-resolution teacher model on images with a resolution of 0.331 m/pixel and a low-resolution student model on publicly available images with a resolution of 10 m/pixel. The student model also utilizes the teacher model's output as its weak label examples through knowledge transfer. We evaluated our framework using Murang'a county in Kenya, renowned for its agricultural productivity, as a use case. Our local models achieved higher quality maps, with improvements of 0.14 in the F1 score and 0.21 in Intersection-over-Union, compared to the best global model. Our evaluation also revealed inconsistencies in existing global maps, with a maximum agreement rate of 0.30 among themselves. Our work provides valuable guidance to decision-makers for driving informed decisions to enhance food security.
