Beyond the Benefits: A Systematic Review of the Harms and Consequences of Generative AI in Computing Education
Seth Bernstein, Ashfin Rahman, Nadia Sharifi, Ariunjargal Terbish, Stephen MacNeil
TL;DR
This study systematically reviews harms and unintended consequences of Generative AI (GenAI) in computing education, covering research published from 2022 to 2025. Using a PRISMA-guided literature review across ACM Digital Library, IEEE Xplore, and Scopus, the authors screened 1,677 references to 224 included papers and performed deductive coding on harms, tool types, tasks, populations, and evidence levels. They identify six broad harm categories—cognition, metacognition, assessment/academic integrity, equity, social, and instructional/logistical—with cognitive and metacognitive harms most prevalent, though many harms lack strong empirical backing and social harms are under-reported. The paper offers a taxonomy of GenAI harms tailored to computing education and discusses grand challenges and a roadmap for future work to maximize benefits while mitigating harms, informing educators, researchers, and tool developers.
Abstract
Generative artificial intelligence (GenAI) has already had a big impact on computing education with prior research identifying many benefits. However, recent studies have also identified potential risks and harms. To continue maximizing AI benefits while addressing the harms and unintended consequences, we conducted a systematic literature review of research focusing on the risks, harms, and unintended consequences of GenAI in computing education. Our search of ACM DL, IEEE Xplore, and Scopus (2022-2025) resulted in 1,677 papers, which were then filtered to 224 based on our inclusion and exclusion criteria. Guided by best practices for systematic reviews, four reviewers independently extracted publication year, learner population, research method, contribution type, GenAI technology, and educational task information from each paper. We then coded each paper for concrete harm categories such as academic integrity, cognitive effects, and trust issues. Our analysis shows patterns in how and where harms appear, highlights methodological gaps and opportunities for more rigorous evidence, and identifies under-explored harms and student populations. By synthesizing these insights, we intend to equip educators, computing students, researchers, and developers with a clear picture of the harms associated with GenAI in computing education.
