Table of Contents
Fetching ...

Cyclic Polygon Plots

Maksim Schreck, Peter Albers, Filip Sadlo

TL;DR

The paper introduces cyclic polygon plots (CPP) to visualize $n$-dimensional data in 2D by decomposing into 2D subspaces and forming polygons that preserve dimension correspondence. It presents two cyclic-pair schemes, ab-bc and ab-cd, along with four placement strategies (intrinsic, geometric, angular, statistical) and benchmarks CPP against PCP and RC across multiple datasets, including a user study. Key contributions include the CPP formulation, intrinsic glyph placement, and a comparative evaluation showing CPP can outperform traditional methods in information density and readability, especially in higher dimensions, though challenges remain for value comparisons and identical vertices. The work demonstrates CPP's potential for scalable, quantitative multivariate analysis and offers practical guidance on scheme selection and placement, with log-scaling shown to enhance readability in dense data.

Abstract

In this paper, we introduce the cyclic polygon plot, a representation based on a novel projection concept for multi-dimensional values. Cyclic polygon plots combine the typically competing requirements of quantitativeness, image-space efficiency, and readability. Our approach is complemented with a placement strategy based on its intrinsic features, resulting in a dimensionality reduction strategy that is consistent with our overall concept. As a result, our approach combines advantages from dimensionality reduction techniques and quantitative plots, supporting a wide range of tasks in multi-dimensional data analysis. We examine and discuss the overall properties of our approach, and demonstrate its utility with a user study and selected examples.

Cyclic Polygon Plots

TL;DR

The paper introduces cyclic polygon plots (CPP) to visualize -dimensional data in 2D by decomposing into 2D subspaces and forming polygons that preserve dimension correspondence. It presents two cyclic-pair schemes, ab-bc and ab-cd, along with four placement strategies (intrinsic, geometric, angular, statistical) and benchmarks CPP against PCP and RC across multiple datasets, including a user study. Key contributions include the CPP formulation, intrinsic glyph placement, and a comparative evaluation showing CPP can outperform traditional methods in information density and readability, especially in higher dimensions, though challenges remain for value comparisons and identical vertices. The work demonstrates CPP's potential for scalable, quantitative multivariate analysis and offers practical guidance on scheme selection and placement, with log-scaling shown to enhance readability in dense data.

Abstract

In this paper, we introduce the cyclic polygon plot, a representation based on a novel projection concept for multi-dimensional values. Cyclic polygon plots combine the typically competing requirements of quantitativeness, image-space efficiency, and readability. Our approach is complemented with a placement strategy based on its intrinsic features, resulting in a dimensionality reduction strategy that is consistent with our overall concept. As a result, our approach combines advantages from dimensionality reduction techniques and quantitative plots, supporting a wide range of tasks in multi-dimensional data analysis. We examine and discuss the overall properties of our approach, and demonstrate its utility with a user study and selected examples.
Paper Structure (39 sections, 10 equations, 7 figures, 2 tables)

This paper contains 39 sections, 10 equations, 7 figures, 2 tables.

Figures (7)

  • Figure 1: \ref{['fig:polygon-creation-illustration-combined']} Cyclic polygon plot (CPP) with ab-bc (orange) and ab-cd (cyan) scheme, including data components $\delta_i$. \ref{['fig:3d-pcp']} CPP (orange) as projection of modified 3D parallel coordinates plot johansson2014usability (red).
  • Figure 2: 5D QCM10 dataset. PCP \ref{['fig:qcm-pcp']}, RC \ref{['fig:qcm-rc']}, CPP with ab-bc \ref{['fig:qcm-abbc']}, and ab-cd \ref{['fig:qcm-abcd']} scheme. Observe linear trend in \ref{['fig:qcm-abbc']}, \ref{['fig:qcm-abcd']}.
  • Figure 3: Iris dataset (4D, linear scale \ref{['fig:iris-pcp']}--\ref{['fig:iris-abcd']}) and 13-dimensional Wine dataset (linear scale \ref{['fig:wine-linear-pcp']}--\ref{['fig:wine-linear-abcd']} and logarithmic scale \ref{['fig:wine-log-pcp']}--\ref{['fig:wine-log-abcd']}). Column arrangement analogous to \ref{['fig:qcm']}.
  • Figure 4: Iris dataset (4D). Comparison of t-SNE \ref{['fig:iris-placement-tsne']} and UMAP \ref{['fig:iris-placement-umap']} with our intrinsic \ref{['fig:iris-placement-avg']}, geometric (ab-cd) \ref{['fig:iris-placement-geo']}, geometric (ab-bc) \ref{['fig:iris-placement-geo-abbc']}, and statistical \ref{['fig:iris-placement-stat']} placement. Subcaptions refer to the corresponding Jaccard index ($\theta$) and silhouette coefficient ($\tau$). Bold values denote best performance (see also \ref{['tab:clustering']}).
  • Figure 5: Wine dataset (13-dimensional). Comparison of t-SNE \ref{['fig:wine-placement-tsne']}, UMAP \ref{['fig:wine-placement-umap']}, and our intrinsic \ref{['fig:wine-placement-avg']}, geometric \ref{['fig:wine-placement-geo']}, angular \ref{['fig:wine-placement-ang']}, and statistical \ref{['fig:wine-placement-stat']} placement.
  • ...and 2 more figures