The DataSquad Experiment: Lessons for Preparing Data and Computer Scientists for Work
Paula Lackie, Elliot Pickens, Dashiell Coyier
TL;DR
DataSquad at Carleton College addresses the data-services gap in small liberal arts settings by deploying a work-study, peer-mentored, project-based model that teaches FAIR data practices and open science. The paper details the model’s roles, governance, and assessment framework, and reports on positive alumni outcomes and high client satisfaction. Analyses show strong development in communication and teamwork—key drivers of career influence—alongside meaningful technical gains, with a robust correlation between skill growth and post-college impact ($r=0.78$, $p=0.024$). The findings support broader adoption and adaptation of DataSquad-like programs to build early-career readiness for research software engineering and data-enabled work.
Abstract
The DataSquad at Carleton College addresses a common problem at small liberal arts colleges: limited capacity for data services and few opportunities for students to gain practical experience with data and software development. Academic Technologist Paula Lackie designed the program as a work-study position that trains undergraduates through structured peer mentorship and real client projects. Students tackle data problems of increasing complexity-from basic data analysis to software development-while learning FAIR data principles and open science practices. The model's core components (peer mentorship structure, project-based learning, and communication training) make it adaptable to other institutions. UCLA and other colleges have adopted the model using openly shared materials through "DataSquad International." This paper describes the program's implementation at Carleton College and examines how structured peer mentorship can simultaneously improve institutional data services and provide students with professional skills and confidence.
