Set Visualizations for Comparing and Evaluating Machine Learning Models
Liudas Panavas, Tarik Crnovrsanin, Racquel Fygenson, Eamon Conway, Derek Millard, Norbou Buchler, Cody Dunne
TL;DR
The paper tackles the challenge of comparing multiple predictors by transforming their outputs into sets and visualizing their intersections, enabling direct model-to-model comparisons. It formalizes a four-criteria method for creating set-type data and introduces SetMLVis, an UpSet-style interactive tool tailored for object-detection evaluation. Through a mixed-methods study against a traditional visualization baseline, the authors show that set visualizations improve task accuracy and reduce cognitive workload, especially on complex analyses. The work contributes a general methodology for set-based model comparison, an open-source tool integrated with Jupyter notebooks, and actionable insights for practitioners seeking more interpretable, scalable evaluation workflows.
Abstract
Machine learning practitioners often need to compare multiple models to select the best one for their application. However, current methods of comparing models fall short because they rely on aggregate metrics that can be difficult to interpret or do not provide enough information to understand the differences between models. To better support the comparison of models, we propose set visualizations of model outputs to enable easier model-to-model comparison. We outline the requirements for using sets to compare machine learning models and demonstrate how this approach can be applied to various machine learning tasks. We also introduce SetMLVis, an interactive system that utilizes set visualizations to compare object detection models. Our evaluation shows that SetMLVis outperforms traditional visualization techniques in terms of task completion and reduces cognitive workload for users. Supplemental materials can be found at https://osf.io/afksu/?view_only=bb7f259426ad425f81d0518a38c597be.
