Redundant is Not Redundant: Automating Efficient Categorical Palette Design Unifying Color & Shape Encodings with CatPAW
Chin Tseng, Arran Zeyu Wang, Ghulam Jilani Quadri, Danielle Albers Szafir
TL;DR
The paper addresses how to design effective redundant color–shape palettes for multiclass scatterplots. It uses four crowdsourced studies to quantify when redundancy helps, how color and shape interact, and how to optimize color–shape pairings, culminating in CatPAW, a web tool that recommends empirical, category-aware palettes. The findings show redundancy yields meaningful accuracy gains most prominently for moderate category counts ($N \approx 5$–$8$), but effectiveness hinges on palette pairing and channel interactions; the authors provide a data-driven framework and a practical design tool to implement these insights. CatPAW’s data-backed approach offers a concrete path toward more reliable categorical visualizations and lays groundwork for future generalization to additional tasks and contexts.
Abstract
Colors and shapes are commonly used to encode categories in multi-class scatterplots. Designers often combine the two channels to create redundant encodings, aiming to enhance class distinctions. However, evidence for the effectiveness of redundancy remains conflicted, and guidelines for constructing effective combinations are limited. This paper presents four crowdsourced experiments evaluating redundant color-shape encodings and identifying high-performing configurations across different category numbers. Results show that redundancy significantly improves accuracy in assessing class-level correlations, with the strongest benefits for 5-8 categories. We also find pronounced interaction effects between colors and shapes, underscoring the need for careful pairing in designing redundant encodings. Drawing on these findings, we introduce a categorical palette design tool that enables designers to construct empirically grounded palettes for effective categorical visualization. Our work advances understanding of categorical perception in data visualization by systematically identifying effective redundant color-shape combinations and embedding these insights into a practical palette design tool.
