Sketch-Augmented Features Improve Learning Long-Range Dependencies in Graph Neural Networks
Ryien Hosseini, Filippo Simini, Venkatram Vishwanath, Rebecca Willett, Henry Hoffmann
TL;DR
This work tackles three key limitations of standard graph neural networks: oversquashing, oversmoothing, and limited expressiveness. It introduces Sketched Random Features (SRF), which embeds node features into a random kernel space and applies a Johnson–Lindenstrauss-style sketch to generate global, distance-preserving node representations that are concatenated to local activations in every GNN layer. The authors provide theoretical properties showing unbiased cross-terms, distance preservation, cross-node information flow, and permutation equivariance in expectation, alongside practical complexity benefits. Empirically, SRF improves performance on synthetic benchmarks and real-world tasks, including social networks, molecular OOD generalization, and long-range peptide interactions, while remaining complementary to existing positional encodings. Overall, SRF offers a scalable, architecture-agnostic enhancement that strengthens non-local reasoning in MPGNNs with minimal overhead.
Abstract
Graph Neural Networks learn on graph-structured data by iteratively aggregating local neighborhood information. While this local message passing paradigm imparts a powerful inductive bias and exploits graph sparsity, it also yields three key challenges: (i) oversquashing of long-range information, (ii) oversmoothing of node representations, and (iii) limited expressive power. In this work we inject randomized global embeddings of node features, which we term \textit{Sketched Random Features}, into standard GNNs, enabling them to efficiently capture long-range dependencies. The embeddings are unique, distance-sensitive, and topology-agnostic -- properties which we analytically and empirically show alleviate the aforementioned limitations when injected into GNNs. Experimental results on real-world graph learning tasks confirm that this strategy consistently improves performance over baseline GNNs, offering both a standalone solution and a complementary enhancement to existing techniques such as graph positional encodings. Our source code is available at \href{https://github.com/ryienh/sketched-random-features}{https://github.com/ryienh/sketched-random-features}.
