A Tale of Two Models: Understanding Data Workers' Internal and External Representations of Complex Data
Connor Scully-Allison, Katy Williams, Stephanie Brink, Olga Pearce, Katherine E. Isaacs
TL;DR
The paper investigates how data workers mental models of heterogeneous data diverge from the reified data model embedded in a domain specific library EnsembleAPI. Through a qualitative study of ten participants using interviews, sketches, and task prompts analyzed via reflexive thematic analysis, the authors identify substantial diversity in mental representations and two parallel hazards that hinder analysis: incomplete mental models and misalignment with the reified model. They discuss implications for user centered design of data tools, including visual exploration techniques, graph based scripting, and improved metadata structure, to bridge gaps between theory and practice and reduce engineering debt. The work highlights the value of probing stakeholder mental models early and embracing multiple representations to support complex data analysis workflows in HPC domains.
Abstract
Data workers may have a a different mental model of their data that the one reified in code. Understanding the organization of their data is necessary for analyzing data, be it through scripting, visualization or abstract thought. More complicated organizations, such as tables with attached hierarchies, may tax people's ability to think about and interact with data. To better understand and ultimately design for these situations, we conduct a study across a team of ten people working with the same reified data model. Through interviews and sketching, we probed their conception of the data model and developed themes through reflexive data analysis. Participants had diverse data models that differed from the reified data model, even among team members who had designed the model, resulting in parallel hazards limiting their ability to reason about the data. From these observations, we suggest potential design interventions for data analysis processes and tools.
