Pearson Correlations on Networks: Corrigendum

Michele Coscia; Karel Devriendt

Pearson Correlations on Networks: Corrigendum

Michele Coscia, Karel Devriendt

TL;DR

This note identifies a fundamental constraint for extending Pearson correlations to networks: the weighting matrix $W$ used to couple node values must be positive definite on $\operatorname{span}(1)^{\perp}$ to avoid imaginary or unbounded results. It shows that using the standard shortest-path distance to define $W=e^{-kP}$ fails this condition in general. The authors formalize the requirement and propose two natural negative type metric approaches—effective resistance and Euclidean node embeddings—which guarantee well-defined network correlations through $W=e^{-kD}$. They provide practical guidance, tests, and public code to enable robust calculation of network correlations in graphs. The results have immediate impact on any method that relies on distance-aware correlations over networks and clarify when standard choices are valid.

Abstract

Recently, the first author proposed a measure to calculate Pearson correlations for node values expressed in a network, by taking into account distances or metrics defined on the network. In this technical note, we show that using an arbitrary choice of distances might result in imaginary or unbounded correlation values, which is undesired. We prove that this problem is solved by restricting to a special class of distances: negative type metrics. We also discuss two natural classes of negative type metrics on graphs, for which the network correlations are properly defined.

Pearson Correlations on Networks: Corrigendum

TL;DR

This note identifies a fundamental constraint for extending Pearson correlations to networks: the weighting matrix

used to couple node values must be positive definite on

to avoid imaginary or unbounded results. It shows that using the standard shortest-path distance to define

fails this condition in general. The authors formalize the requirement and propose two natural negative type metric approaches—effective resistance and Euclidean node embeddings—which guarantee well-defined network correlations through

. They provide practical guidance, tests, and public code to enable robust calculation of network correlations in graphs. The results have immediate impact on any method that relies on distance-aware correlations over networks and clarify when standard choices are valid.

Abstract

Paper Structure (10 sections, 2 theorems, 5 equations, 1 figure)

This paper contains 10 sections, 2 theorems, 5 equations, 1 figure.

Introduction
Network Correlation
Problem of Negative Variances
Conditions on the Weight Matrix $W$
Examples
Solution
Negative Type Metric Spaces
Effective Resistance
Node Embeddings
Conclusion

Key Result

Proposition 1

The network Pearson correlation eq:rho-net is well-defined (real and finite) for all non-constant input vectors $x,y$ if and only if the weight matrix $W$ is positive definite on $\operatorname{span}(1)^\perp$. Furthermore, in this case the network Pearson correlation is a real number in $[-1,1]$.

Figures (1)

Figure 1: A network with negative eigenvalues for $W = e^{-kP}$.

Theorems & Definitions (2)

Proposition 1
Proposition 2: meckes2013positive

Pearson Correlations on Networks: Corrigendum

TL;DR

Abstract

Pearson Correlations on Networks: Corrigendum

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (2)