SimpleSets: Capturing Categorical Point Patterns with Simple Shapes
Steven van den Broek, Wouter Meulemans, Bettina Speckmann
TL;DR
SimpleSets tackles the challenge of visualizing categorical point data by using simple enclosing shapes to capture spatial patterns. It introduces islands (convex clusters) and banks (bounded-bend polylines) as the building blocks, and a two-phase pipeline that partitions data and then renders enclosing shapes with a careful stacking and overlap-resolution strategy. The core contributions are formal definitions of simple patterns, a greedy partitioning algorithm with regularity and overlap delays, and a drawing method based on Minkowski-dilated patterns, local stacking, and curve-modification to produce aesthetically pleasing, low-distortion set visualizations. Evaluations on standard benchmarks show that SimpleSets often outperforms existing hull- and Voronoi-based methods in cognitive-load and distortion measures, with open-source code enabling reproducibility and further exploration.
Abstract
Points of interest on a map such as restaurants, hotels, or subway stations, give rise to categorical point data: data that have a fixed location and one or more categorical attributes. Consequently, recent years have seen various set visualization approaches that visually connect points of the same category to support users in understanding the spatial distribution of categories. Existing methods use complex and often highly irregular shapes to connect points of the same category, leading to high cognitive load for the user. In this paper we introduce SimpleSets, which uses simple shapes to enclose categorical point patterns, thereby providing a clean overview of the data distribution. SimpleSets is designed to visualize sets of points with a single categorical attribute; as a result, the point patterns enclosed by SimpleSets form a partition of the data. We give formal definitions of point patterns that correspond to simple shapes and describe an algorithm that partitions categorical points into few such patterns. Our second contribution is a rendering algorithm that transforms a given partition into a clean set of shapes resulting in an aesthetically pleasing set visualization. Our algorithm pays particular attention to resolving intersections between nearby shapes in a consistent manner. We compare SimpleSets to the state-of-the-art set visualizations using standard datasets from the literature.
