Table of Contents
Fetching ...

Physics informed Transformer-VAE for biophysical parameter estimation: PROSAIL model inversion in Sentinel-2 imagery

Prince Mensah, Pelumi Victor Aderinto, Ibrahim Salihu Yusuf, Arnu Pretorius

TL;DR

This work addresses the challenge of retrieving canopy biophysical variables from satellite data without needing real‑image labels for training. It introduces a physics‑informed Transformer‑VAE that embeds the PROSAIL radiative transfer model as a differentiable decoder, trained exclusively on synthetic PROSAIL simulations to infer posterior distributions over canopy parameters such as $LAI$ and $CCC$. The model demonstrates competitive accuracy on real field datasets (FRM4Veg and BelSAR) compared with methods trained on real imagery, while providing uncertainty quantification via the latent distribution. By coupling physical model constraints with a Transformer encoder, the approach achieves physically plausible inversions and suggests a viable path to global, calibration‑free vegetation trait products from Sentinel‑2 imagery, with potential extensions to hyperspectral data and additional RTMs.

Abstract

Accurate retrieval of vegetation biophysical variables from satellite imagery is crucial for ecosystem monitoring and agricultural management. In this work, we propose a physics-informed Transformer-VAE architecture to invert the PROSAIL radiative transfer model for simultaneous estimation of key canopy parameters from Sentinel-2 data. Unlike previous hybrid approaches that require real satellite images for self-supevised training. Our model is trained exclusively on simulated data, yet achieves performance on par with state-of-the-art methods that utilize real imagery. The Transformer-VAE incorporates the PROSAIL model as a differentiable physical decoder, ensuring that inferred latent variables correspond to physically plausible leaf and canopy properties. We demonstrate retrieval of leaf area index (LAI) and canopy chlorophyll content (CCC) on real-world field datasets (FRM4Veg and BelSAR) with accuracy comparable to models trained with real Sentinel-2 data. Our method requires no in-situ labels or calibration on real images, offering a cost-effective and self-supervised solution for global vegetation monitoring. The proposed approach illustrates how integrating physical models with advanced deep networks can improve the inversion of RTMs, opening new prospects for large-scale, physically-constrained remote sensing of vegetation traits.

Physics informed Transformer-VAE for biophysical parameter estimation: PROSAIL model inversion in Sentinel-2 imagery

TL;DR

This work addresses the challenge of retrieving canopy biophysical variables from satellite data without needing real‑image labels for training. It introduces a physics‑informed Transformer‑VAE that embeds the PROSAIL radiative transfer model as a differentiable decoder, trained exclusively on synthetic PROSAIL simulations to infer posterior distributions over canopy parameters such as and . The model demonstrates competitive accuracy on real field datasets (FRM4Veg and BelSAR) compared with methods trained on real imagery, while providing uncertainty quantification via the latent distribution. By coupling physical model constraints with a Transformer encoder, the approach achieves physically plausible inversions and suggests a viable path to global, calibration‑free vegetation trait products from Sentinel‑2 imagery, with potential extensions to hyperspectral data and additional RTMs.

Abstract

Accurate retrieval of vegetation biophysical variables from satellite imagery is crucial for ecosystem monitoring and agricultural management. In this work, we propose a physics-informed Transformer-VAE architecture to invert the PROSAIL radiative transfer model for simultaneous estimation of key canopy parameters from Sentinel-2 data. Unlike previous hybrid approaches that require real satellite images for self-supevised training. Our model is trained exclusively on simulated data, yet achieves performance on par with state-of-the-art methods that utilize real imagery. The Transformer-VAE incorporates the PROSAIL model as a differentiable physical decoder, ensuring that inferred latent variables correspond to physically plausible leaf and canopy properties. We demonstrate retrieval of leaf area index (LAI) and canopy chlorophyll content (CCC) on real-world field datasets (FRM4Veg and BelSAR) with accuracy comparable to models trained with real Sentinel-2 data. Our method requires no in-situ labels or calibration on real images, offering a cost-effective and self-supervised solution for global vegetation monitoring. The proposed approach illustrates how integrating physical models with advanced deep networks can improve the inversion of RTMs, opening new prospects for large-scale, physically-constrained remote sensing of vegetation traits.

Paper Structure

This paper contains 15 sections, 6 equations, 5 figures, 2 tables.

Figures (5)

  • Figure 1: End‐to‐end architecture: (a) training with transformer-VAE & PROSAIL decoder, (b) inference and validation.
  • Figure 2: Spatial distribution of field validation points at the Las Tiesas-Barrax agricultural test site in central Spain, with measurements collected during two separate campaigns in 2018 (green points) and 2021 (red points).
  • Figure 3: Field validation points at Wytham Woods, showing the distribution of in-situ LAI and CCC measurements (red points) throughout the ancient deciduous forest site.
  • Figure 4: Agricultural test site used for the BelSAR campaign in central Belgium, showing a mosaic of crop fields with validation sites highlighted in red.
  • Figure 5: Top: Predicted versus in-situ measured Leaf Area Index (LAI) across all validation sites and land cover types. Bottom: Predicted versus in-situ measured Canopy Chlorophyll Content (CCC) for the same sites and land covers. Each point represents a field measurement, colored by land cover class and shaped by site. Error bars indicate prediction uncertainty. The black line is the 1:1 line (perfect agreement), and the red line is the best-fit regression.