Table of Contents
Fetching ...

MAGNOLIA: Matching Algorithms via GNNs for Online Value-to-go Approximation

Alexandre Hayderi, Amin Saberi, Ellen Vitercik, Anders Wikum

TL;DR

The paper tackles online Bayesian bipartite matching (OBBM), where irrevocable online decisions must maximize a final weighted matching under uncertain future arrivals. It introduces Magnolia, a Graph Neural Network that learns to approximate the value-to-go (VTG) of each action, effectively emulating the online optimal policy OPT_on through VTG predictions. Theoretical results show VTG can be locally approximated on bipartite random geometric graphs via small, locally aggregating subgraphs, providing justification for GNN-based VTG estimation. Empirically, Magnolia outperforms strong baselines across diverse graph families and sizes, demonstrates strong generalization and robustness to noise, and benefits from meta-models to adapt to different regimes, highlighting practical potential for online matching in digital marketplaces.

Abstract

Online Bayesian bipartite matching is a central problem in digital marketplaces and exchanges, including advertising, crowdsourcing, ridesharing, and kidney exchange. We introduce a graph neural network (GNN) approach that emulates the problem's combinatorially-complex optimal online algorithm, which selects actions (e.g., which nodes to match) by computing each action's value-to-go (VTG) -- the expected weight of the final matching if the algorithm takes that action, then acts optimally in the future. We train a GNN to estimate VTG and show empirically that this GNN returns high-weight matchings across a variety of tasks. Moreover, we identify a common family of graph distributions in spatial crowdsourcing applications, such as rideshare, under which VTG can be efficiently approximated by aggregating information within local neighborhoods in the graphs. This structure matches the local behavior of GNNs, providing theoretical justification for our approach.

MAGNOLIA: Matching Algorithms via GNNs for Online Value-to-go Approximation

TL;DR

The paper tackles online Bayesian bipartite matching (OBBM), where irrevocable online decisions must maximize a final weighted matching under uncertain future arrivals. It introduces Magnolia, a Graph Neural Network that learns to approximate the value-to-go (VTG) of each action, effectively emulating the online optimal policy OPT_on through VTG predictions. Theoretical results show VTG can be locally approximated on bipartite random geometric graphs via small, locally aggregating subgraphs, providing justification for GNN-based VTG estimation. Empirically, Magnolia outperforms strong baselines across diverse graph families and sizes, demonstrates strong generalization and robustness to noise, and benefits from meta-models to adapt to different regimes, highlighting practical potential for online matching in digital marketplaces.

Abstract

Online Bayesian bipartite matching is a central problem in digital marketplaces and exchanges, including advertising, crowdsourcing, ridesharing, and kidney exchange. We introduce a graph neural network (GNN) approach that emulates the problem's combinatorially-complex optimal online algorithm, which selects actions (e.g., which nodes to match) by computing each action's value-to-go (VTG) -- the expected weight of the final matching if the algorithm takes that action, then acts optimally in the future. We train a GNN to estimate VTG and show empirically that this GNN returns high-weight matchings across a variety of tasks. Moreover, we identify a common family of graph distributions in spatial crowdsourcing applications, such as rideshare, under which VTG can be efficiently approximated by aggregating information within local neighborhoods in the graphs. This structure matches the local behavior of GNNs, providing theoretical justification for our approach.
Paper Structure (58 sections, 16 theorems, 31 equations, 11 figures, 2 tables, 1 algorithm)

This paper contains 58 sections, 16 theorems, 31 equations, 11 figures, 2 tables, 1 algorithm.

Key Result

Lemma 3.2

Let $\bm{x_1}, \dots, \bm{x}_N \in [0,1]^d$, $\varepsilon > 0$, $\Delta \leq \frac{\varepsilon}{2d}$, and $k = \lceil\frac{\varepsilon}{2d\Delta}\rceil$. If $\lVert\bm{x_i} - \bm{x_j}\rVert_\infty \leq \Delta$, then with probability at most $\varepsilon$ over $\pi \sim \Pi_k$, $\bm{x_i}$ and $\bm{x

Figures (11)

  • Figure 1: Magnolia's GNN-based matching subroutine.
  • Figure 2: Boxplot showing the distribution of competitive ratios for Magnolia and baselines across graph configurations. All graphs are of size (10×20), and results for additional configurations are available in \ref{['appendix: base_more_params']}.
  • Figure 3: Evolution of competitive ratio over graphs of increasing size for a GNN trained on graphs of size (6×10). All test graphs have a 2:1 ratio of online to offline nodes. Because the threshold $t$ for greedy-t is selected from a validation set over diverse graph configurations, it does not perform equally well on all configurations. Results for additional graph configurations are in \ref{['appendix: size_more_params']}
  • Figure 4: Evolution of competitive ratio over regimes for Magnolia enabled with a meta-GNN. For evaluation, $|L|$ is kept fixed at 16 offline nodes, and $|R|$ varies from 8 to 45 online nodes. Results for additional graph configurations are available in \ref{['appendix: meta-models']}.
  • Figure 5: Evolution of competitive ratio as a function of noise level $\rho$ for graphs of size (10×30). Results for additional graph configurations are available in \ref{['appendix: noise-generalization']}.
  • ...and 6 more figures

Theorems & Definitions (35)

  • Definition 3.1
  • Definition 3.2
  • Lemma 3.2
  • proof : Proof sketch
  • Definition 3.3
  • Lemma 3.3
  • Corollary 3.3
  • Theorem 3.4
  • proof : Proof sketch
  • Definition 3.5: Tahmasebi23:Power
  • ...and 25 more