Supporting Annotators with Affordances for Efficiently Labeling Conversational Data
Austin Z. Henley, David Piorkowski
TL;DR
This work addresses the bottleneck of creating ground-truth labels for machine learning by introducing CAL, a production-quality, affordance-rich annotation interface for conversational data. CAL integrates code-set documentation, prevents invalid labels, guides label selection via a wizard, and provides quick access to previous labels, all within a single tool. In a within-subjects study against a standard spreadsheet, CAL significantly reduced cognitive load ($p < 0.05$) and achieved higher usability ($p < 0.001$), with most participants preferring CAL and no one preferring the spreadsheet. The findings demonstrate that integrated affordances can markedly improve annotator experience and efficiency, with future work aimed at automated label suggestions and fatigue monitoring to further enhance reliability and throughput.
Abstract
Without well-labeled ground truth data, machine learning-based systems would not be as ubiquitous as they are today, but these systems rely on substantial amounts of correctly labeled data. Unfortunately, crowdsourced labeling is time consuming and expensive. To address the concerns of effort and tedium, we designed CAL, a novel interface to aid in data labeling. We made several key design decisions for CAL, which include preventing inapt labels from being selected, guiding users in selecting an appropriate label when they need assistance, incorporating labeling documentation into the interface, and providing an efficient means to view previous labels. We implemented a production-quality implementation of CAL and report a user-study evaluation that compares CAL to a standard spreadsheet. Key findings of our study include users using CAL reported lower cognitive load, did not increase task time, users rated CAL to be easier to use, and users preferred CAL over the spreadsheet.
