Developer Productivity With and Without GitHub Copilot: A Longitudinal Mixed-Methods Case Study

Viktoria Stray; Elias Goldmann Brandtzæg; Viggo Tellefsen Wivestad; Astri Barbala; Nils Brede Moe

Developer Productivity With and Without GitHub Copilot: A Longitudinal Mixed-Methods Case Study

Viktoria Stray, Elias Goldmann Brandtzæg, Viggo Tellefsen Wivestad, Astri Barbala, Nils Brede Moe

TL;DR

This study investigates the real-world impact of GitHub Copilot on developer activity and perceived productivity within NAV IT over two years using a longitudinal mixed-methods design. It combines 26,317 commits across 703 repositories with surveys and 13 interviews to compare 25 Copilot users to 14 non-users, revealing a pre-existing gap in activity favoring Copilot adopters and only modest post-adoption changes in commit frequency. Importantly, perceived productivity tended to increase for Copilot users even when objective commit-based metrics remained stable, highlighting a discrepancy between subjective experience and measurable output. The findings suggest that GenAI tools may primarily reduce cognitive load and enhance workflow rather than dramatically boosting raw code production, and they call for broader, multi-dimensional productivity metrics that capture developer well-being and flow in addition to output.

Abstract

This study investigates the real-world impact of the generative AI (GenAI) tool GitHub Copilot on developer activity and perceived productivity. We conducted a mixed-methods case study in NAV IT, a large public sector agile organization. We analyzed 26,317 unique non-merge commits from 703 of NAV IT's GitHub repositories over a two-year period, focusing on commit-based activity metrics from 25 Copilot users and 14 non-users. The analysis was complemented by survey responses on their roles and perceived productivity, as well as 13 interviews. Our analysis of activity metrics revealed that individuals who used Copilot were consistently more active than non-users, even prior to Copilot's introduction. We did not find any statistically significant changes in commit-based activity for Copilot users after they adopted the tool, although minor increases were observed. This suggests a discrepancy between changes in commit-based metrics and the subjective experience of productivity.

Developer Productivity With and Without GitHub Copilot: A Longitudinal Mixed-Methods Case Study

TL;DR

Abstract

Developer Productivity With and Without GitHub Copilot: A Longitudinal Mixed-Methods Case Study

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)