Table of Contents
Fetching ...

Surrogate Graph Partitioning for Spatial Prediction

Yuta Shikuri, Hironori Fujisawa

TL;DR

The paper addresses interpretable spatial prediction by formulating a surrogate graph-partitioning problem that minimizes within-segment variance of predicted values, cast as a mixed-integer quadratic program $\min\|\bm{W}\bm{v}-\bm{\eta}\|_2^2$ with $\bm{W}\in\{0,1\}^{n\times m}$. To tackle computational intractability for large $n$, it introduces an approximation based on prior aggregation, establishing an additive guarantee $c_2=2\|\tilde{\bm{\eta}}-\bm{\eta}\|_2$ and proving that, under certain conditions, data points within a sublabel can share a common label in an optimal solution. The methodology combines Gaussian process regression with variational inference for the predictor $\eta$, a prior-aggregation step to reduce problem size, and a flow-based, MIQP-driven connected graph partitioning approach to yield spatial segments that preserve interpretability. Experimental results on California Housing and National Risk Index demonstrate that the MIQP-based segmentation achieves lower intra-group variance than baselines, while the approximation substantially improves scalability; constraints enforcing connectivity help prevent distant regions from being spuriously linked. Overall, the paper offers a practical, interpretable surrogate modeling framework for spatial prediction with provable approximation guarantees and scalable computation.

Abstract

Spatial prediction refers to the estimation of unobserved values from spatially distributed observations. Although recent advances have improved the capacity to model diverse observation types, adoption in practice remains limited in industries that demand interpretability. To mitigate this gap, surrogate models that explain black-box predictors provide a promising path toward interpretable decision making. In this study, we propose a graph partitioning problem to construct spatial segments that minimize the sum of within-segment variances of individual predictions. The assignment of data points to segments can be formulated as a mixed-integer quadratic programming problem. While this formulation potentially enables the identification of exact segments, its computational complexity becomes prohibitive as the number of data points increases. Motivated by this challenge, we develop an approximation scheme that leverages the structural properties of graph partitioning. Experimental results demonstrate the computational efficiency of this approximation in identifying spatial segments.

Surrogate Graph Partitioning for Spatial Prediction

TL;DR

The paper addresses interpretable spatial prediction by formulating a surrogate graph-partitioning problem that minimizes within-segment variance of predicted values, cast as a mixed-integer quadratic program with . To tackle computational intractability for large , it introduces an approximation based on prior aggregation, establishing an additive guarantee and proving that, under certain conditions, data points within a sublabel can share a common label in an optimal solution. The methodology combines Gaussian process regression with variational inference for the predictor , a prior-aggregation step to reduce problem size, and a flow-based, MIQP-driven connected graph partitioning approach to yield spatial segments that preserve interpretability. Experimental results on California Housing and National Risk Index demonstrate that the MIQP-based segmentation achieves lower intra-group variance than baselines, while the approximation substantially improves scalability; constraints enforcing connectivity help prevent distant regions from being spuriously linked. Overall, the paper offers a practical, interpretable surrogate modeling framework for spatial prediction with provable approximation guarantees and scalable computation.

Abstract

Spatial prediction refers to the estimation of unobserved values from spatially distributed observations. Although recent advances have improved the capacity to model diverse observation types, adoption in practice remains limited in industries that demand interpretability. To mitigate this gap, surrogate models that explain black-box predictors provide a promising path toward interpretable decision making. In this study, we propose a graph partitioning problem to construct spatial segments that minimize the sum of within-segment variances of individual predictions. The assignment of data points to segments can be formulated as a mixed-integer quadratic programming problem. While this formulation potentially enables the identification of exact segments, its computational complexity becomes prohibitive as the number of data points increases. Motivated by this challenge, we develop an approximation scheme that leverages the structural properties of graph partitioning. Experimental results demonstrate the computational efficiency of this approximation in identifying spatial segments.

Paper Structure

This paper contains 28 sections, 9 theorems, 43 equations, 5 figures, 2 tables, 1 algorithm.

Key Result

Theorem 5.3

Suppose that assump_identical and assump0 are satisfied. Then, in an optimal solution, data points with the same sublabel are assigned a common label.

Figures (5)

  • Figure 1: Overview of our approach. Data points are grouped through graph partitioning, which is formulated as an MIQP problem. This formulation is approximated, with a performance guarantee, by aggregating nearby data points in advance. The color at each vertex represents the magnitude of the individual predictions. The dashed boxes in the middle figure indicate the aggregation units.
  • Figure 2: Example of \ref{['additional_constraint']}. Let $\mathcal{V}$ be the collection of vertex sets that induce connected subgraphs whose vertices share the same color. Without imposing the constraint characterized by $\mathcal{V}$, the beige vertex sets remain connected via one blue and three light blue vertices. However, once a blue or light blue vertex set in $\mathcal{V}$ is removed, this connection is broken.
  • Figure 3: Example of prior aggregation. Let $n = 8, m = 2$, and $l = 3$. The upper table, which corresponds to \ref{['assump0']}, illustrates how labels are reassigned for the first and third groups of data points sharing the same sublabel. For the second group, no replacement is performed because its assignment in the optimal solution has only a single label. The lower table shows the relationship between the entries of $\bm{\eta}$ and $\tilde{\bm{\eta}}$.
  • Figure 4: Examples of violations of \ref{['assump_connect']} and \ref{['assump_connect2']}. Each square represents a group of data points that share the same sublabel. Gray regions highlight the violation of the assumptions. Unconnected vertices in the original undirected graph violate \ref{['assump_connect']}. Additionally, \ref{['assump_connect2']} is violated when the removal of vertices in the gray square with multiple labels causes a connected component in the optimal solution to become disconnected.
  • Figure 5: Spatial segments obtained through connected graph partitioning using the MIQP formulation. For each target variable, the result of the trial with the largest total sum of squares is displayed. The leftmost panel presents individual predictions $\bm{\eta}$ from Gaussian process regression. The second panel displays approximate predictions $\tilde{\bm{\eta}}$ from the prior aggregation. The remaining panels show segmentations with $m \in \{2, 3, 4\}$ clusters. Colors indicate the levels of the predicted values.

Theorems & Definitions (15)

  • Theorem 5.3
  • Theorem 5.4
  • Proposition 5.5
  • Proposition 5.6
  • Proposition 5.9
  • Lemma 5.11
  • Theorem 5.13
  • proof
  • proof
  • proof
  • ...and 5 more