Wolves in the Repository: A Software Engineering Analysis of the XZ Utils Supply Chain Attack
Piotr Przymus, Thomas Durieux
TL;DR
The paper investigates a sophisticated supply chain attack on XZ Utils (CVE-2024-3094) that leveraged software engineering practices to gain long-term control over a critical OSS component. It deploys a mixed-methods analysis of Git histories, GitHub events, mailing lists, and security data to reconstruct the attack timeline and attacker tactics. The findings show the attacker built credibility through non-code contributions (documentation, translations, CI/CD, and repository infrastructure), gradually displacing the primary maintainer and enabling malicious releases. The study highlights governance, tooling, and detection implications for the OSS ecosystem and provides concrete guidance for preventing similar attacks in high-impact projects.
Abstract
The digital economy runs on Open Source Software (OSS), with an estimated 90\% of modern applications containing open-source components. While this widespread adoption has revolutionized software development, it has also created critical security vulnerabilities, particularly in essential but under-resourced projects. This paper examines a sophisticated attack on the XZ Utils project (CVE-2024-3094), where attackers exploited not just code, but the entire open-source development process to inject a backdoor into a fundamental Linux compression library. Our analysis reveals a new breed of supply chain attack that manipulates software engineering practices themselves -- from community management to CI/CD configurations -- to establish legitimacy and maintain long-term control. Through a comprehensive examination of GitHub events and development artifacts, we reconstruct the attack timeline, analyze the evolution of attacker tactics. Our findings demonstrate how attackers leveraged seemingly beneficial contributions to project infrastructure and maintenance to bypass traditional security measures. This work extends beyond traditional security analysis by examining how software engineering practices themselves can be weaponized, offering insights for protecting the open-source ecosystem.
