AI-assisted Programming May Decrease the Productivity of Experienced Developers by Increasing Maintenance Burden
Feiyang Xu, Poonacha K. Medappa, Murat M. Tunc, Martijn Vroegindeweij, Jan C. Fransoo
TL;DR
This study investigates how AI-assisted programming with GitHub Copilot affects OSS development, using a Difference-in-Differences design around Copilot’s technical preview to compare treatment (endorsed languages) and control groups. At the project level, Copilot is associated with measurable productivity gains (lines added, commits, PRs), yet PR rework increases, signaling lower initial code quality and higher maintenance burden. At the individual level, productivity gains are driven by peripheral contributors, while core contributors experience reduced direct coding activity and a rising maintenance load (more PR reviews and broader repository coverage), culminating in a 19% drop in core commits alongside a 6.5% rise in reviews. Overall, the results suggest AI-era productivity gains may mask growing maintenance burdens on a shrinking pool of experienced maintainers, with implications for OSS sustainability and software governance in broader ecosystems.
Abstract
Generative AI solutions like GitHub Copilot have been shown to increase the productivity of software developers. Yet prior work remains unclear on the quality of code produced and the challenges of maintaining it in software projects. If quality declines as volume grows, experienced developers face increased workloads reviewing and reworking code from less-experienced contributors. We analyze developer activity in Open Source Software (OSS) projects following the introduction of GitHub Copilot. We find that productivity indeed increases. However, the increase in productivity is primarily driven by less-experienced (peripheral) developers. We also find that code written after the adoption of AI requires more rework. Importantly, the added rework burden falls on the more experienced (core) developers, who review 6.5% more code after Copilot's introduction, but show a 19% drop in their original code productivity. More broadly, this finding raises caution that productivity gains of AI may mask the growing burden of maintenance on a shrinking pool of experts.
