Grounding force-directed network layouts with latent space models
Felix Gaisbauer, Armin Pournaki, Sven Banisch, Eckehard Olbrich
TL;DR
This paper presents a principled way to ground force-directed network layouts in latent space models, making node positions interpretable as maximum-likelihood estimates of latent positions and parameters. By deriving force equations for unweighted, cumulative, and weighted networks from a latent-space probability model, the Leipzig Layout returns layouts whose geometry reflects probabilistic tie formation. Validation includes comparisons to modularity-based interpretations and application to real networks (Facebook friendship, German parliament Twitter, Harper’s letter, and survey data), showing meaningful separations and axis structures while highlighting local minima as a limitation. The approach provides a framework to spatialise data beyond traditional networks and offers a blueprint for extending force-directed layouts via alternative interaction models.
Abstract
Force-directed layout algorithms are ubiquitously-used tools for network visualisation across a multitude of scientific disciplines. However, they lack theoretical grounding which allows to interpret their outcomes rigorously and can guide the choice of specific algorithms for certain data sets. We propose an approach building on latent space models, which assume that the probability of nodes forming a tie depends on their distance in an unobserved latent space. From such latent space models, we derive force equations for a force-directed layout algorithm. Since the forces infer positions which maximise the likelihood of the given network under the latent space model, the force-directed layout becomes interpretable. We implement these forces for unweighted and weighted networks and spatialise different real-world networks. Comparison to existing layout algorithms (not grounded in an interpretable model) reveals that node groups are placed in similar configurations, while said algorithms show a stronger intra-cluster separation of nodes, as well as a tendency to separate clusters more strongly in retweet networks. We also explore the possibility of visualising data traditionally not seen as network data, such as survey data.
