Exact Subgraph Isomorphism Network with Mixed $L_{0,2}$ Norm Constraint for Predictive Graph Mining

Taiga Kojima; Haruto Kajita; Ayato Kohara; Masayuki Karasuyama

Exact Subgraph Isomorphism Network with Mixed $L_{0,2}$ Norm Constraint for Predictive Graph Mining

Taiga Kojima, Haruto Kajita, Ayato Kohara, Masayuki Karasuyama

TL;DR

EIN tackles graph-level prediction by integrating exact subgraph enumeration with a neural network, guided by a mixed $L_{0,2}$ sparsity constraint to select a small, interpretable set of predictive subgraphs. The Graph Mining Layer jointly learns subgraph representations through a linear aggregation over candidate subgraphs, while an iterative hard-thresholding optimization and a pruning scheme based on gradient upper bounds make the approach scalable. The paper provides convergence guarantees under standard smoothness and KL assumptions and demonstrates competitive accuracy on synthetic and real-world datasets, with strong post-hoc interpretability using SHAP, trees, and RF analyses. Combining exact subgraph information with neural models also enables flexible integration with Graph Neural Networks, improving discriminative power without sacrificing interpretability or tractability.

Abstract

In the graph-level prediction task (predict a label for a given graph), the information contained in subgraphs of the input graph plays a key role. In this paper, we propose Exact subgraph Isomorphism Network (EIN), which combines the exact subgraph enumeration, a neural network, and a sparse regularization by the mixed $L_{0,2}$ norm constraint. In general, building a graph-level prediction model achieving high discriminative ability along with interpretability is still a challenging problem. Our combination of the subgraph enumeration and neural network contributes to high discriminative ability about the subgraph structure of the input graph. Further, the sparse regularization in EIN enables us 1) to derive an effective pruning strategy that mitigates computational difficulty of the enumeration while maintaining the prediction performance, and 2) to identify important subgraphs that contributes to high interpretability. We empirically show that EIN has sufficiently high prediction performance compared with standard graph neural network models, and also, we show examples of post-hoc analysis based on the selected subgraphs.

Exact Subgraph Isomorphism Network with Mixed $L_{0,2}$ Norm Constraint for Predictive Graph Mining

TL;DR

EIN tackles graph-level prediction by integrating exact subgraph enumeration with a neural network, guided by a mixed

sparsity constraint to select a small, interpretable set of predictive subgraphs. The Graph Mining Layer jointly learns subgraph representations through a linear aggregation over candidate subgraphs, while an iterative hard-thresholding optimization and a pruning scheme based on gradient upper bounds make the approach scalable. The paper provides convergence guarantees under standard smoothness and KL assumptions and demonstrates competitive accuracy on synthetic and real-world datasets, with strong post-hoc interpretability using SHAP, trees, and RF analyses. Combining exact subgraph information with neural models also enables flexible integration with Graph Neural Networks, improving discriminative power without sacrificing interpretability or tractability.

Abstract

norm constraint. In general, building a graph-level prediction model achieving high discriminative ability along with interpretability is still a challenging problem. Our combination of the subgraph enumeration and neural network contributes to high discriminative ability about the subgraph structure of the input graph. Further, the sparse regularization in EIN enables us 1) to derive an effective pruning strategy that mitigates computational difficulty of the enumeration while maintaining the prediction performance, and 2) to identify important subgraphs that contributes to high interpretability. We empirically show that EIN has sufficiently high prediction performance compared with standard graph neural network models, and also, we show examples of post-hoc analysis based on the selected subgraphs.

Exact Subgraph Isomorphism Network with Mixed $L_{0,2}$ Norm Constraint for Predictive Graph Mining

TL;DR

Abstract

Exact Subgraph Isomorphism Network with Mixed $L_{0,2}$ Norm Constraint for Predictive Graph Mining

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (19)