Table of Contents
Fetching ...

An information-geometric approach for network decomposition using the q-state Potts model

Alexandre L. M. Levada

TL;DR

Experimental results with several kinds of networks show that the high information subgraph is often related to edges and boundaries, while the low information subgraph is a smoother version of the network, in the sense that the modular structure is improved.

Abstract

Complex networks are critical in many scientific, technological, and societal contexts due to their ability to represent and analyze intricate systems with interdependent components. Often, after labeling the nodes of a network with a community detection algorithm, its modular organization emerges, allowing a better understanding of the underlying structure by uncovering hidden relationships. In this paper, we introduce a novel information-geometric framework for the filtering and decomposition of networks whose nodes have been labeled. Our approach considers the labeled network as the outcome of a Markov random field modeled by a q-state Potts model. According to information geometry, the first and second order Fisher information matrices are related to the metric and curvature tensor of the parametric space of a statistical model. By computing an approximation to the local shape operator, the proposed methodology is able to identify low and high information nodes, allowing the decomposition of the labeled network in two complementary subgraphs. Hence, we call this method as the LO-HI decomposition. Experimental results with several kinds of networks show that the high information subgraph is often related to edges and boundaries, while the low information subgraph is a smoother version of the network, in the sense that the modular structure is improved.

An information-geometric approach for network decomposition using the q-state Potts model

TL;DR

Experimental results with several kinds of networks show that the high information subgraph is often related to edges and boundaries, while the low information subgraph is a smoother version of the network, in the sense that the modular structure is improved.

Abstract

Complex networks are critical in many scientific, technological, and societal contexts due to their ability to represent and analyze intricate systems with interdependent components. Often, after labeling the nodes of a network with a community detection algorithm, its modular organization emerges, allowing a better understanding of the underlying structure by uncovering hidden relationships. In this paper, we introduce a novel information-geometric framework for the filtering and decomposition of networks whose nodes have been labeled. Our approach considers the labeled network as the outcome of a Markov random field modeled by a q-state Potts model. According to information geometry, the first and second order Fisher information matrices are related to the metric and curvature tensor of the parametric space of a statistical model. By computing an approximation to the local shape operator, the proposed methodology is able to identify low and high information nodes, allowing the decomposition of the labeled network in two complementary subgraphs. Hence, we call this method as the LO-HI decomposition. Experimental results with several kinds of networks show that the high information subgraph is often related to edges and boundaries, while the low information subgraph is a smoother version of the network, in the sense that the modular structure is improved.

Paper Structure

This paper contains 10 sections, 25 equations, 6 figures, 5 tables.

Figures (6)

  • Figure 1: The high information nodes have value of $S_i(\beta)$ belonging to the interval $(Q3, max]$.
  • Figure 2: Visualization of the qualitative results for the pendigits dataset. From left to right and top to bottom: a) Original network; b) Original network with high information nodes highlighted in black; c) L-subgraph (smooth); d) H-subgraph (boundaries).
  • Figure 3: Visualization of the qualitative results for the optdigits dataset. From left to right and top to bottom: a) Original network; b) Original network with high information nodes highlighted in black; c) L-subgraph (smooth); d) H-subgraph (boundaries).
  • Figure 4: Visualization of the qualitative results for the football network. From left to right and top to bottom: a) Original network; b) Original network with high information nodes highlighted in red; c) L-subgraph (smooth); d) H-subgraph (boundaries).
  • Figure 5: Visualization of the qualitative results for the bio-diseasome network. From left to right and top to bottom: a) Original network; b) Original network with high information nodes highlighted in red; c) L-subgraph (smooth); d) H-subgraph (boundaries).
  • ...and 1 more figures

Theorems & Definitions (3)

  • Definition 1: First-order Fisher information
  • Definition 2: Second-order Fisher information
  • Definition 3: Shape operator