The quest for the GRAph Level autoEncoder (GRALE)
Paul Krzakala, Gabriel Melo, Charlotte Laclau, Florence d'Alché-Buc, Rémi Flamary
TL;DR
GRALE tackles graph_level representation learning by encoding entire graphs into a shared Euclidean space and decoding them back to graphs of varying sizes. It replaces costly graph matching with a differentiable, learnable matching component and optimizes a graph_edit_distance-like objective via an Optimal Transport_loss, facilitated by an Evoformer-based encoder and a novel Evoformer Decoder. The framework demonstrates strong reconstruction quality and broad downstream applicability, including graph classification, regression, graph prediction, and graph matching, with impressive results on synthetic COLORING data and large molecular datasets like PUBCHEM. By enabling end_to_end pretraining on large graph corpora, GRALE offers a versatile foundation for graph_enabled tasks, while acknowledging computational complexity as a primary limitation and pointing to future work in scalable attention and accelerated differentiation of the Sinkhorn projection.
Abstract
Although graph-based learning has attracted a lot of attention, graph representation learning is still a challenging task whose resolution may impact key application fields such as chemistry or biology. To this end, we introduce GRALE, a novel graph autoencoder that encodes and decodes graphs of varying sizes into a shared embedding space. GRALE is trained using an Optimal Transport-inspired loss that compares the original and reconstructed graphs and leverages a differentiable node matching module, which is trained jointly with the encoder and decoder. The proposed attention-based architecture relies on Evoformer, the core component of AlphaFold, which we extend to support both graph encoding and decoding. We show, in numerical experiments on simulated and molecular data, that GRALE enables a highly general form of pre-training, applicable to a wide range of downstream tasks, from classification and regression to more complex tasks such as graph interpolation, editing, matching, and prediction.
