A Theory of Random Graph Shift in Truncated-Spectrum vRKHS

Zhang Wan; Tingting Mu; Samuel Kaski

A Theory of Random Graph Shift in Truncated-Spectrum vRKHS

Zhang Wan, Tingting Mu, Samuel Kaski

Abstract

This paper develops a theory of graph classification under domain shift through a random-graph generative lens, where we consider intra-class graphs sharing the same random graph model (RGM) and the domain shift induced by changes in RGM components. While classic domain adaptation (DA) theories have well-underpinned existing techniques to handle graph distribution shift, the information of graph samples, which are itself structured objects, is less explored. The non-Euclidean nature of graphs and specialized architectures for graph learning further complicate a fine-grained analysis of graph distribution shifts. In this paper, we propose a theory that assumes RGM as the data generative process, exploiting its connection to hypothesis complexity in function space perspective for such fine-grained analysis. Building on a vector-valued reproducing kernel Hilbert space (vRKHS) formulation, we derive a generalization bound whose shift penalty admits a factorization into (i) a domain discrepancy term, (ii) a spectral-geometry term summarized by the accessible truncated spectrum, and (iii) an amplitude term that aggregates convergence and construction-stability effects. We empirically verify the insights on these terms in both real data and simulations.

A Theory of Random Graph Shift in Truncated-Spectrum vRKHS

Abstract

Paper Structure (74 sections, 28 theorems, 256 equations, 21 figures, 10 tables)

This paper contains 74 sections, 28 theorems, 256 equations, 21 figures, 10 tables.

Introduction
Preliminaries
Main Results
Pullback Domain Divergence
Spectrum-Amplitude Factorization
Hypothesis Disagreement Analysis
Experiments
Exp 1: Latent Wasserstein Distance as Domain Divergence ($\Delta_D$).
Exp 2: Truncated-Spectrum and Geometry ($\lambda_r$).
Exp 3: Amplitude ($K'\Xi+K"$).
Conclusion
Notations
Definitions
On RGM and Its Deformation
On Neural Network Families and Perturbation
...and 59 more sections

Key Result

Proposition 2.2

Given a source domain $D_S=(\mu_S, g_{D})$, a target domain $D_T=(\mu_T, g_{D})$, a hypothesis $h$, and the source and target error risks $\epsilon_S(h,g_D)$ and $\epsilon_T(h,g_D)$ assessed by Eq. (error_function). Then, the following holds: where $\mathcal{H}_{K_\ell}$ is the vRKHS of the hypothesis function $h$, the labeling function $g_D$, and the loss mapping function $\ell_{h,g_D}$.

Figures (21)

Figure 1: Estimated latent WD correlates with test losses.
Figure 2: Layerwise perturbation sensitivity. Early-layer sensitivity (conv0) is consistently the largest. Both non-uniform schemes reduce early-layer fragility relative to L2.
Figure 3: Estimated latent WD correlates with test losses.
Figure 4: Eigenvalues on IMDB-MULTI (1k), NCI1 (2k), PROTEINS (1k).
Figure 5: $d_{\mathrm{eff}}$ on IMDB-MULTI (1k), NCI1 (2k), PROTEINS (1k).
...and 16 more figures

Theorems & Definitions (64)

Definition 2.1: Random Graph Model
Proposition 2.2: Domain Adaptation Generalization Error
Theorem 3.1: Main Theorem
Corollary 3.2
Proposition 3.3
Proposition 3.5
Remark 3.6
Theorem 3.7: Class-conditional convergence maskey2022generalization
Theorem 3.9: Optimization term bound via construction stability
Definition B.1: Function Sampling Operator
...and 54 more

A Theory of Random Graph Shift in Truncated-Spectrum vRKHS

Abstract

A Theory of Random Graph Shift in Truncated-Spectrum vRKHS

Authors

Abstract

Table of Contents

Key Result

Figures (21)

Theorems & Definitions (64)