Unveiling High-dimensional Backstage: A Survey for Reliable Visual Analytics with Dimensionality Reduction
Hyeon Jeon, Hyunwook Lee, Yun-Hsin Kuo, Taehyun Yang, Daniel Archambault, Sungahn Ko, Takanori Fujiwara, Kwan-Liu Ma, Jinwook Seo
TL;DR
This paper tackles unreliability in visual analytics that rely on dimensionality reduction (DR) by delivering a holistic framework: a detailed workflow model mapping analyst and machine roles across six stages, a taxonomy linking problems, aims, and solutions, and a meta-analysis of 133 studies to reveal landscape patterns. It demonstrates that most work concentrates on creating new DR techniques rather than evaluating or interpreting them, and it documents practical reliability challenges—including overreliance on 2D scatterplots and lack of libraries. The authors validate their findings with eight DR experts and offer actionable guidance, including an interactive browser and a reader-friendly guide to navigate the literature. Collectively, the contributions provide a structured, human-centered roadmap for improving DR-based visual analytics, with implications for researchers and practitioners seeking more reliable, interpretable, and usable visualization tools.
Abstract
Dimensionality reduction (DR) techniques are essential for visually analyzing high-dimensional data. However, visual analytics using DR often face unreliability, stemming from factors such as inherent distortions in DR projections. This unreliability can lead to analytic insights that misrepresent the underlying data, potentially resulting in misguided decisions. To tackle these reliability challenges, we review 133 papers that address the unreliability of visual analytics using DR. Through this review, we contribute (1) a workflow model that describes the interaction between analysts and machines in visual analytics using DR, and (2) a taxonomy that identifies where and why reliability issues arise within the workflow, along with existing solutions for addressing them. Our review reveals ongoing challenges in the field, whose significance and urgency are validated by five expert researchers. This review also finds that the current research landscape is skewed toward developing new DR techniques rather than their interpretation or evaluation, where we discuss how the HCI community can contribute to broadening this focus.
