Climplicit: Climatic Implicit Embeddings for Global Ecological Tasks
Johannes Dollinger, Damien Robert, Elena Plekhanova, Lukas Drees, Jan Dirk Wegner
TL;DR
Climplicit presents a high-resolution, lightweight spatio-temporal climate encoder pretrained on CHELSA climatologies to generate implicit climate representations for any Earth location and month. By introducing ReSIREN and a direct temporal embedding, the model achieves strong downstream performance across biome classification, species distribution modeling, and plant trait regression while drastically reducing storage needs. Ablation studies confirm the importance of residual connections, temporal encoding, and CHELSA-based pretraining, and results indicate competitive advantages over existing geolocation encoders. The approach enables easier, more scalable ecological learning with reduced carbon footprint, albeit with some limitations related to implicit representations and resolution relative to full climate rasters.
Abstract
Deep learning on climatic data holds potential for macroecological applications. However, its adoption remains limited among scientists outside the deep learning community due to storage, compute, and technical expertise barriers. To address this, we introduce Climplicit, a spatio-temporal geolocation encoder pretrained to generate implicit climatic representations anywhere on Earth. By bypassing the need to download raw climatic rasters and train feature extractors, our model uses x3500 less disk space and significantly reduces computational needs for downstream tasks. We evaluate our Climplicit embeddings on biomes classification, species distribution modeling, and plant trait regression. We find that single-layer probing our Climplicit embeddings consistently performs better or on par with training a model from scratch on downstream tasks and overall better than alternative geolocation encoding models.
