Spatioformer: A Geo-encoded Transformer for Large-Scale Plant Species Richness Prediction

Yiqing Guo; Karel Mokany; Shaun R. Levick; Jinyan Yang; Peyman Moghadam

Spatioformer: A Geo-encoded Transformer for Large-Scale Plant Species Richness Prediction

Yiqing Guo, Karel Mokany, Shaun R. Levick, Jinyan Yang, Peyman Moghadam

TL;DR

The paper tackles the problem of mapping plant species richness at continental scales where $eta$-diversity causes location-dependent relationships with spectral data. It introduces Spatioformer, a transformer augmented with a geolocation encoder that projects geospatial coordinates into high-dimensional token space using multi-scale sinusoidal functions, enabling location-aware richness predictions. On a large Australian HAVPlot dataset (68,170 samples) paired with Landsat Geomedian imagery (2015–2023), Spatioformer outperforms CNN, ViT, and FactoFormer baselines, achieving $r=0.77$, $r^2=0.59$, MAE $=7.83$, MSE $=105.85$, RMSE $=10.29$, and a low $RSE=0.11$. The authors produce annual richness maps and uncertainty maps via Monte Carlo Dropout to guide future field surveys, and discuss limitations and future directions including combining environmental predictors and exploring hyperspectral data for broader applicability and improved interpretability.

Abstract

Earth observation data have shown promise in predicting species richness of vascular plants ($α$-diversity), but extending this approach to large spatial scales is challenging because geographically distant regions may exhibit different compositions of plant species ($β$-diversity), resulting in a location-dependent relationship between richness and spectral measurements. In order to handle such geolocation dependency, we propose \textit{Spatioformer}, where a novel geolocation encoder is coupled with the transformer model to encode geolocation context into remote sensing imagery. The Spatioformer model compares favourably to state-of-the-art models in richness predictions on a large-scale ground-truth richness dataset (HAVPlot) that consists of 68,170 in-situ richness samples covering diverse landscapes across Australia. The results demonstrate that geolocational information is advantageous in predicting species richness from satellite observations over large spatial scales. With Spatioformer, plant species richness maps over Australia are compiled from Landsat archive for the years from 2015 to 2023. The richness maps produced in this study reveal the spatiotemporal dynamics of plant species richness in Australia, providing supporting evidence to inform effective planning and policy development for plant diversity conservation. Regions of high richness prediction uncertainties are identified, highlighting the need for future in-situ surveys to be conducted in these areas to enhance the prediction accuracy.

Spatioformer: A Geo-encoded Transformer for Large-Scale Plant Species Richness Prediction

TL;DR

The paper tackles the problem of mapping plant species richness at continental scales where

-diversity causes location-dependent relationships with spectral data. It introduces Spatioformer, a transformer augmented with a geolocation encoder that projects geospatial coordinates into high-dimensional token space using multi-scale sinusoidal functions, enabling location-aware richness predictions. On a large Australian HAVPlot dataset (68,170 samples) paired with Landsat Geomedian imagery (2015–2023), Spatioformer outperforms CNN, ViT, and FactoFormer baselines, achieving

, MAE

, MSE

, RMSE

, and a low

. The authors produce annual richness maps and uncertainty maps via Monte Carlo Dropout to guide future field surveys, and discuss limitations and future directions including combining environmental predictors and exploring hyperspectral data for broader applicability and improved interpretability.

Abstract

Earth observation data have shown promise in predicting species richness of vascular plants (

-diversity), but extending this approach to large spatial scales is challenging because geographically distant regions may exhibit different compositions of plant species (

-diversity), resulting in a location-dependent relationship between richness and spectral measurements. In order to handle such geolocation dependency, we propose \textit{Spatioformer}, where a novel geolocation encoder is coupled with the transformer model to encode geolocation context into remote sensing imagery. The Spatioformer model compares favourably to state-of-the-art models in richness predictions on a large-scale ground-truth richness dataset (HAVPlot) that consists of 68,170 in-situ richness samples covering diverse landscapes across Australia. The results demonstrate that geolocational information is advantageous in predicting species richness from satellite observations over large spatial scales. With Spatioformer, plant species richness maps over Australia are compiled from Landsat archive for the years from 2015 to 2023. The richness maps produced in this study reveal the spatiotemporal dynamics of plant species richness in Australia, providing supporting evidence to inform effective planning and policy development for plant diversity conservation. Regions of high richness prediction uncertainties are identified, highlighting the need for future in-situ surveys to be conducted in these areas to enhance the prediction accuracy.

Spatioformer: A Geo-encoded Transformer for Large-Scale Plant Species Richness Prediction

TL;DR

Abstract

Spatioformer: A Geo-encoded Transformer for Large-Scale Plant Species Richness Prediction

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)