MoCoLSK: Modality Conditioned High-Resolution Downscaling for Land Surface Temperature
Qun Dai, Chunyang Yuan, Yimian Dai, Yuxuan Li, Xiang Li, Kang Ni, Jianhui Xu, Xiangbo Shu, Jian Yang
TL;DR
This work tackles high-resolution LST downscaling by addressing spatial non-stationarity and the lack of open-source tooling. It introduces MoCoLSK-Net, a modality-conditioned dynamic fusion network that combines a Large Selective Kernel pathway with a modality-conditioned weight generator to adapt receptive fields for multi-modal guidance data. The authors also launch GrokLST, an open-source ecosystem comprising a 30 m HR LST benchmark (GrokLST dataset) and a PyTorch toolkit with 40+ baselines to standardize evaluation. On the GrokLST dataset, MoCoLSK-Net achieves state-of-the-art performance across multiple downscaling factors, confirming its ability to capture complex, texture-rich relationships in multispectral data. Overall, the work provides a practical, reproducible pathway for high-resolution LST retrieval with significant implications for environmental monitoring and climate studies.
Abstract
Land Surface Temperature (LST) is a critical parameter for environmental studies, but directly obtaining high spatial resolution LST data remains challenging due to the spatio-temporal trade-off in satellite remote sensing. Guided LST downscaling has emerged as an alternative solution to overcome these limitations, but current methods often neglect spatial non-stationarity, and there is a lack of an open-source ecosystem for deep learning methods. In this paper, we propose the Modality-Conditional Large Selective Kernel (MoCoLSK) Network, a novel architecture that dynamically fuses multi-modal data through modality-conditioned projections. MoCoLSK achieves a confluence of dynamic receptive field adjustment and multi-modal feature fusion, leading to enhanced LST prediction accuracy. Furthermore, we establish the GrokLST project, a comprehensive open-source ecosystem featuring the GrokLST dataset, a high-resolution benchmark, and the GrokLST toolkit, an open-source PyTorch-based toolkit encapsulating MoCoLSK alongside 40+ state-of-the-art approaches. Extensive experimental results validate MoCoLSK's effectiveness in capturing complex dependencies and subtle variations within multispectral data, outperforming existing methods in LST downscaling. Our code, dataset, and toolkit are available at https://github.com/GrokCV/GrokLST.
