Table of Contents
Fetching ...

Towards characterizing the value of edge embeddings in Graph Neural Networks

Dhruv Rohatgi, Tanya Marwah, Zachary Chase Lipton, Jianfeng Lu, Ankur Moitra, Andrej Risteski

TL;DR

The benefits of architectures that maintain and update edge embeddings are considered and it is shown that there are natural tasks on graphical models for which architectures leveraging edge embeddings can be much shallower.

Abstract

Graph neural networks (GNNs) are the dominant approach to solving machine learning problems defined over graphs. Despite much theoretical and empirical work in recent years, our understanding of finer-grained aspects of architectural design for GNNs remains impoverished. In this paper, we consider the benefits of architectures that maintain and update edge embeddings. On the theoretical front, under a suitable computational abstraction for a layer in the model, as well as memory constraints on the embeddings, we show that there are natural tasks on graphical models for which architectures leveraging edge embeddings can be much shallower. Our techniques are inspired by results on time-space tradeoffs in theoretical computer science. Empirically, we show architectures that maintain edge embeddings almost always improve on their node-based counterparts -- frequently significantly so in topologies that have ``hub'' nodes.

Towards characterizing the value of edge embeddings in Graph Neural Networks

TL;DR

The benefits of architectures that maintain and update edge embeddings are considered and it is shown that there are natural tasks on graphical models for which architectures leveraging edge embeddings can be much shallower.

Abstract

Graph neural networks (GNNs) are the dominant approach to solving machine learning problems defined over graphs. Despite much theoretical and empirical work in recent years, our understanding of finer-grained aspects of architectural design for GNNs remains impoverished. In this paper, we consider the benefits of architectures that maintain and update edge embeddings. On the theoretical front, under a suitable computational abstraction for a layer in the model, as well as memory constraints on the embeddings, we show that there are natural tasks on graphical models for which architectures leveraging edge embeddings can be much shallower. Our techniques are inspired by results on time-space tradeoffs in theoretical computer science. Empirically, we show architectures that maintain edge embeddings almost always improve on their node-based counterparts -- frequently significantly so in topologies that have ``hub'' nodes.

Paper Structure

This paper contains 1 section, 7 theorems, 6 equations.

Key Result

Theorem 3

For any symmetric edge message-passing protocol on $G$, the function $f$ that it computes must satisfy assumption:fe. Similarly, for any symmetric node message-passing protocol on $G$, the function $f$ that it computes must satisfy assumption:fe.

Theorems & Definitions (13)

  • Definition 1
  • Theorem 3
  • Theorem 4
  • proof
  • Lemma 5
  • Theorem 6
  • proof
  • Theorem 7
  • proof
  • Lemma 8
  • ...and 3 more