Simbanex: Similarity-based Exploration of IEEE VIS Publications
Daniel Witschard, Ilir Jusufi, Andreas Kerren
TL;DR
This paper introduces Simbanex, a visual analytics tool for interactive similarity exploration in bibliometric networks by decomposing a multivariate publication network into separately embeddable aspects such as topology, text, authors, and numerical counts. It proposes an aspect driven all embedding strategy that uses multiple embeddings to produce a homogeneous similarity framework and enables a novel similarity based clustering. The authors demonstrate two use cases citation link analysis and topic similarity to show how similarity patterns can reveal missing citations and sub topic structures, respectively. They discuss limitations including scalability and the need for human in the loop to interpret similarity in complex data, and argue for broad applicability of the approach beyond MVNs.
Abstract
Embeddings are powerful tools for transforming complex and unstructured data into numeric formats suitable for computational analysis tasks. In this work, we use multiple embeddings for similarity calculations to be applied in bibliometrics and scientometrics. We build a multivariate network (MVN) from a large set of scientific publications and explore an aspect-driven analysis approach to reveal similarity patterns in the given publication data. By dividing our MVN into separately embeddable aspects, we are able to obtain a flexible vector representation which we use as input to a novel method of similarity-based clustering. Based on these preprocessing steps, we developed a visual analytics application, called Simbanex, that has been designed for the interactive visual exploration of similarity patterns within the underlying publications.
