Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery

Zhenyu Yu; Jinnian Wang; Mohd Yamani Idna Idris

Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery

Zhenyu Yu, Jinnian Wang, Mohd Yamani Idna Idris

TL;DR

The paper tackles the challenge of high-resolution spatial carbon stock estimation from optical remote sensing data. It introduces the Improved Implicit Diffusion Model (IIDM), which fuses knowledge distillation (via KD-VGG and KD-UNet), cross-attention-based feature fusion, and implicit neural representations within a diffusion framework to estimate carbon-stock distribution density from GF-1 WFV imagery. IIDM achieves a leading RMSE of $12.17\%$, representing a $41.69\%$–$42.33\%$ improvement over traditional regression models, and demonstrates substantial reductions in model size and inference time through PCA-based KD and structured pruning. The approach provides high-resolution (16 m) carbon-stock maps suitable for forest-sink regulation and showcases the feasibility of AI-generated content methods for large-scale, data-driven remote sensing applications. The work also validates generalization across regions (e.g., Yunnan Province) and seasons, underscoring IIDM's potential for broad deployment in global carbon monitoring.

Abstract

The forest serves as the most significant terrestrial carbon stock mechanism, effectively reducing atmospheric CO2 concentrations and mitigating climate change. Remote sensing provides high data accuracy and enables large-scale observations. Optical images facilitate long-term monitoring, which is crucial for future carbon stock estimation studies. This study focuses on Huize County, Qujing City, Yunnan Province, China, utilizing GF-1 WFV satellite imagery. The KD-VGG and KD-UNet modules were introduced for initial feature extraction, and the improved implicit diffusion model (IIDM) was proposed. The results showed: (1) The VGG module improved initial feature extraction, improving accuracy, and reducing inference time with optimized model parameters. (2) The Cross-attention + MLPs module enabled effective feature fusion, establishing critical relationships between global and local features, achieving high-accuracy estimation. (3) The IIDM model, a novel contribution, demonstrated the highest estimation accuracy with an RMSE of 12.17%, significantly improving by 41.69% to 42.33% compared to the regression model. In carbon stock estimation, the generative model excelled in extracting deeper features, significantly outperforming other models, demonstrating the feasibility of AI-generated content in quantitative remote sensing. The 16-meter resolution estimates provide a robust basis for tailoring forest carbon sink regulations, enhancing regional carbon stock management.

Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery

TL;DR

Abstract

Improved implicit diffusion model with knowledge distillation to estimate the spatial distribution density of carbon stock in remote sensing imagery

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)