Algorithms in the Stacks: Investigating automated, for-profit diversity audits in public libraries
Melanie Walsh, Connor Franklin Rey, Chang Ge, Tina Nowak, Sabina Tomkins
TL;DR
The paper examines how automated, for-profit collection diversity audits used by U.S. public libraries shape holdings, workflow, and governance. Through a mixed-methods approach (survey of 99 librarians, 14 interviews, GovSpend purchasing data, and vendor materials), it finds that these tools offer speed and benchmarking but tend to flatten diversity into broad categories, rely on proprietary metadata, and deepen vendor dependence. The study highlights concerns about transparency, metadata quality, and misalignment with local community needs, while noting the practical utility of audits for justification and decision support under resource constraints. It concludes with concrete recommendations for transparency, flexibility, open-source alternatives, human-in-the-loop processes, and collective action to ensure that automation serves the public good in an increasingly contested political landscape.
Abstract
Algorithmic systems are increasingly being adopted by cultural heritage institutions like libraries. In this study, we investigate U.S. public libraries' adoption of one specific automated tool -- automated collection diversity audits -- which we see as an illuminating case study for broader trends. Typically developed and sold by commercial book distributors, automated diversity audits aim to evaluate how well library collections reflect demographic and thematic diversity. We investigate how these audits function, whether library workers find them useful, and what is at stake when sensitive, normative decisions about representation are outsourced to automated commercial systems. Our analysis draws on an anonymous survey of U.S. public librarians (n=99), interviews with 14 librarians, a sample of purchasing records, and vendor documentation. We find that many library workers view these tools as convenient, time-saving solutions for assessing and diversifying collections under real and increasing constraints. Yet at the same time, the audits often flatten complex identities into standardized categories, fail to reflect local community needs, and further entrench libraries' infrastructural dependence on vendors. We conclude with recommendations for improving collection diversity audits and reflect on the broader implications for public libraries operating at the intersection of AI adoption, escalating anti-DEI backlash, and politically motivated defunding.
