PISA: Prioritized Invariant Subgraph Aggregation
Ali Ghasemi, Farooq Ahmad Wani, Maria Sofia Bucarelli, Fabrizio Silvestri
TL;DR
The paper addresses the challenge of out-of-distribution generalization for graph data, where distribution shifts affect both node features and topology. It extends invariant subgraph learning by proposing PISA, which trains multiple invariant GNN branches with diversity injections and then uses a dynamic MLP to weight subgraph evidence per instance. Key contributions include (i) learning multiple invariant subgraphs via sampling and a diversity penalty, (ii) a cross-branch contrastive objective to align same-label subgraphs, and (iii) a second-stage adaptive aggregator that captures nonlinear interactions among subgraphs. Across 15 datasets, including DrugOOD, PISA achieves state-of-the-art robustness, with up to 5% improvements over prior methods and strong performance in multi-subgraph scenarios, demonstrating practical gains in real-world graph tasks with distribution shifts.
Abstract
Recent work has extended the invariance principle for out-of-distribution (OOD) generalization from Euclidean to graph data, where challenges arise due to complex structures and diverse distribution shifts in node attributes and topology. To handle these, Chen et al. proposed CIGA (Chen et al., 2022b), which uses causal modeling and an information-theoretic objective to extract a single invariant subgraph capturing causal features. However, this single-subgraph focus can miss multiple causal patterns. Liu et al. (2025) addressed this with SuGAr, which learns and aggregates diverse invariant subgraphs via a sampler and diversity regularizer, improving robustness but still relying on simple uniform or greedy aggregation. To overcome this, the proposed PISA framework introduces a dynamic MLP-based aggregation that prioritizes and combines subgraph representations more effectively. Experiments on 15 datasets, including DrugOOD (Ji et al., 2023), show that PISA achieves up to 5% higher classification accuracy than prior methods.
