Working with Color: How Color Quantization Can Aid Researchers of Problematic Information
Nina Lutz, Jordyn W. Padzensky, Joseph S. Schafer
TL;DR
This paper addresses the challenge of analyzing large image collections of problematic information without over-relying on AI due to biases and computational costs. It introduces a lightweight, non-AI color quantization method embedded in a mixed-methods, human-in-the-loop pipeline to study visual rhetoric around the US–Mexico border. The approach leverages HSV-based color quantization, k-means clustering, Monk Skin Tone orb palettes, and symbol palettes, integrated with Label Studio for qualitative coding and sample selection, with a media-forensics use case. The study discusses limitations and advocates a historicist, critically reflective reclamation of such methods for scalable, ethical visual research in CSCW and misinformation studies.
Abstract
Analyzing large sets of visual media remains a challenging task, particularly in mixed-method studies dealing with problematic information and human subjects. Using AI tools in such analyses risks reifying and exacerbating biases, as well as untenable computational and cost limitations. As such, we turn to adopting geometric computer graphics and vision methods towards analyzing a large set of images from a problematic information campaign, in conjunction with human-in-the-loop qualitative analysis. We illustrate an effective case of this approach with the implementation of color quantization towards analyzing online hate image at the US-Mexico border, along with a historicist trace of the history of color quantization and skin tone scales, to inform our usage and reclamation of these methodologies from their racist origins. To that end, we scaffold motivations and the need for more researchers to consider the advantages and risks of reclaiming such methodologies in their own work, situated in our case study.
