Interpretable Visualizations of Data Spaces for Classification Problems
Christian Jorgensen, Arthur Y. Lin, Rhushil Vasavada, Rose K. Cersonsky
TL;DR
This work introduces principal covariates classification (PCovC), a hybrid supervised-unsupervised mapping technique designed to visualize and interpret classification decision boundaries in data spaces. By integrating a classifier-derived evidence matrix into a PCovR-like framework, PCovC yields low-dimensional latent spaces that reflect both data structure and classification performance, enabling qualitative and quantitative analysis of boundaries across diverse domains. Through case studies in neurotoxicity, organosulfur spectroscopy, inorganic materials, and MNIST, the authors demonstrate improved boundary delineation, robust interpretability of feature influence, and practical benefits for downstream tasks like MNIST preprocessing before nonlinear embeddings. The approach offers a general, scalable pathway to unbox machine-learning decisions in chemistry and related fields, with an open-source implementation in scikit-matter.
Abstract
How do classification models "see" our data? Based on their success in delineating behaviors, there must be some lens through which it is easy to see the boundary between classes; however, our current set of visualization techniques makes this prospect difficult. In this work, we propose a hybrid supervised-unsupervised technique distinctly suited to visualizing the decision boundaries determined by classification problems. This method provides a human-interpretable map that can be analyzed qualitatively and quantitatively, which we demonstrate through visualizing and interpreting a decision boundary for chemical neurotoxicity. While we discuss this method in the context of chemistry-driven problems, its application can be generalized across subfields for "unboxing" the operations of machine-learning classification models.
