Derivation of Back-propagation for Graph Convolutional Networks using Matrix Calculus and its Application to Explainable Artificial Intelligence
Yen-Che Hsiao, Rongting Yue, Abhishek Dutta
TL;DR
This paper derives an analytic, closed-form back-propagation gradient for Graph Convolutional Networks using matrix calculus, extending to arbitrary depths and arbitrary element-wise activation functions. It targets two canonical graph tasks: node classification and link prediction, and validates the gradient by comparing against reverse-mode automatic differentiation, showing median SSE in the range $10^{-18}$ to $10^{-14}$. The authors provide explicit matrix-based expressions leveraging Kronecker, Hadamard, and permutation matrices, and they extend the framework to sensitivity analysis for explainable AI. While the method incurs higher computational cost than AD, it provides exact gradient expressions and a foundation for interpretable gradient-based optimization in GCNs.
Abstract
This paper provides a comprehensive and detailed derivation of the backpropagation algorithm for graph convolutional neural networks using matrix calculus. The derivation is extended to include arbitrary element-wise activation functions and an arbitrary number of layers. The study addresses two fundamental problems, namely node classification and link prediction. To validate our method, we compare it with reverse-mode automatic differentiation. The experimental results demonstrate that the median sum of squared errors of the updated weight matrices, when comparing our method to the approach using reverse-mode automatic differentiation, falls within the range of $10^{-18}$ to $10^{-14}$. These outcomes are obtained from conducting experiments on a five-layer graph convolutional network, applied to a node classification problem on Zachary's karate club social network and a link prediction problem on a drug-drug interaction network. Finally, we show how the derived closed-form solution can facilitate the development of explainable AI and sensitivity analysis.
