Comparing Deep Learning Models for Rice Mapping in Bhutan Using High Resolution Satellite Imagery
Biplov Bhandari, Timothy Mayer
TL;DR
This study tackles high-resolution rice mapping in Paro, Bhutan, by comparing two DL paradigms—DNN (pixel-based) and U-Net (patch-based)—across multiple input configurations built from NICFI Planet imagery and supplementary data. The results show that U-Net consistently outperforms DNN, with the best validation performance achieved by the RGBNE configuration (and related RGBN/RGBNES variants), while independent validation highlights model- and metric-dependent variability. Weak labeling via SERVIR's RLCMS helps address class imbalance and enhances sampling design for deep learning, enabling 10 m rice maps that can complement DoA survey approaches. Additional features such as NDVI, EVI, and NDWI provided limited improvement, suggesting that core spectral information and spatial context suffice for effective rice mapping at this resolution, though future work could explore time-series and transformer-based approaches for phenology-driven mapping.
Abstract
The Bhutanese government is increasing its utilization of technological approaches such as including Remote Sensing-based knowledge in their decision-making process. This study focuses on crop type and crop extent in Paro, one of the top rice-yielding districts in Bhutan, and employs publicly available NICFI high-resolution satellite imagery from Planet. Two Deep Learning (DL) approaches, point-based (DNN) and patch-based (U-Net), models were used in conjunction with cloud-computing platforms. Three different models per DL approaches (DNN and U-Net) were trained: 1) RGBN channels from Planet; 2) RGBN and elevation data (RGBNE); 3) RGBN and Sentinel-1 (S1) data (RGBNS), and RGBN with E and S1 data (RGBNES). From this comprehensive analysis, the U-Net displayed higher performance metrics across both model training and model validation efforts. Among the U-Net model sets, the RGBN, RGBNE, RGBNS, and RGBNES models had an F1-score of 0.8546, 0.8563, 0.8467, and 0.8500 respectively. An independent model evaluation was performed and found a high level of performance variation across all the metrics. For this independent model evaluation, the U-Net RGBN, RGBNE, RGBNES, and RGBN models displayed the F1-scores of 0.5935, 0.6154, 0.5882, and 0.6582, suggesting U-Net RGBNES as the best model. The study shows that the DL approaches can predict rice. Also, DL methods can be used with the survey-based approaches currently utilized by the Bhutan Department of Agriculture. Further, this study demonstrated the usage of regional land cover products such as SERVIR's RLCMS as a weak label approach to capture different strata addressing the class imbalance problem and improving the sampling design for DL application. Finally, through preliminary model testing and comparisons outlined it was shown that using additional features such as NDVI, EVI, and NDWI did not drastically improve model performance.
