Interpretability of Graph Neural Networks to Assess Effects of Global Change Drivers on Ecological Networks
Emre Anakok, Pierre Barbillon, Colin Fontaine, Elisa Thebault
TL;DR
This paper tackles how global change drivers influence plant–pollinator networks and addresses the interpretability of graph neural networks (GNNs) in this ecological context. It develops and applies a bipartite variational graph auto-encoder (BVGAE) and its fair variant to Spipoll data, enabling connectivity prediction and covariate attribution while accounting for sampling bias. Through extensive simulations and a real-data application, the study shows that attribution methods can detect single-effect covariates and their sign, but struggles when covariate effects interact with plant genera or are inflated by observer bias. The findings emphasize the potential of BVGAE-based interpretability to uncover how land use and climate covariates shape pollination network connectivity, while highlighting the need for cautious interpretation with citizen-science datasets and complex interaction effects.
Abstract
Pollinators play a crucial role for plant reproduction, either in natural ecosystem or in human-modified landscape. Global change drivers,including climate change or land use modifications, can alter the plant-pollinator interactions. To assess the potential influence of global change drivers on pollination, large-scale interactions, climate and land use data are required. While recent machine learning methods, such as graph neural networks (GNNs), allow the analysis of such datasets, interpreting their results can be challenging. We explore existing methods for interpreting GNNs in order to highlight the effects of various environmental covariates on pollination network connectivity. An extensive simulation study is performed to confirm whether these methods can detect the interactive effect between a covariate and a genus of plant on connectivity, and whether the application of debiasing techniques influences the estimation of these effects. An application on the Spipoll dataset, with and without accounting for sampling effects, highlights the potential impact of land use on network connectivity and shows that accounting for sampling effects partially alters the estimation of these effects.
