Table of Contents
Fetching ...

Adapting Installation Instructions in Rapidly Evolving Software Ecosystems

Haoyu Gao, Christoph Treude, Mansooreh Zahedi

TL;DR

This work tackles the challenge of maintaining installation-related README content in rapidly evolving OSS ecosystems. It conducts a large-scale qualitative study of 1,168 README commits from 400 GitHub repositories, culminating in a 189-code taxonomy across six update categories and three trigger types. A new installation-focused README template is proposed and validated via online surveys and a pull request study, showing measurable improvements in perceived documentation quality, particularly in structure, understandability, and conciseness. The findings offer actionable guidance for practitioners and lay groundwork for automated documentation tools that integrate human-in-the-loop updates to keep README files accurate and usable as projects evolve.

Abstract

README files play an important role in providing installation-related instructions to software users and are widely used in open source software systems on platforms such as GitHub. However, these files often suffer from various documentation issues, leading to challenges in comprehension and potential errors in content. Despite their significance, there is a lack of systematic understanding regarding the documentation efforts invested in README files, especially in the context of installation-related instructions, which are crucial for users to start with a software project. To fill the research gap, we conducted a qualitative study, investigating 400 GitHub repositories with 1,163 README commits that focused on updates in installation-related sections. Our research revealed six major categories of changes in the README commits, namely pre-installation instructions, installation instructions, post-installation instructions, help information updates, document presentation, and external resource management. We further provide detailed insights into modification behaviours and offer examples of these updates. Based on our findings, we propose a README template tailored to cover the installation-related sections for documentation maintainers to reference when updating documents. We further validate this template by conducting an online survey, identifying that documentation readers find the augmented documents based on our template are generally of better quality. We further provide recommendations to practitioners for maintaining their README files, as well as motivations for future research directions... (too long for arxiv)

Adapting Installation Instructions in Rapidly Evolving Software Ecosystems

TL;DR

This work tackles the challenge of maintaining installation-related README content in rapidly evolving OSS ecosystems. It conducts a large-scale qualitative study of 1,168 README commits from 400 GitHub repositories, culminating in a 189-code taxonomy across six update categories and three trigger types. A new installation-focused README template is proposed and validated via online surveys and a pull request study, showing measurable improvements in perceived documentation quality, particularly in structure, understandability, and conciseness. The findings offer actionable guidance for practitioners and lay groundwork for automated documentation tools that integrate human-in-the-loop updates to keep README files accurate and usable as projects evolve.

Abstract

README files play an important role in providing installation-related instructions to software users and are widely used in open source software systems on platforms such as GitHub. However, these files often suffer from various documentation issues, leading to challenges in comprehension and potential errors in content. Despite their significance, there is a lack of systematic understanding regarding the documentation efforts invested in README files, especially in the context of installation-related instructions, which are crucial for users to start with a software project. To fill the research gap, we conducted a qualitative study, investigating 400 GitHub repositories with 1,163 README commits that focused on updates in installation-related sections. Our research revealed six major categories of changes in the README commits, namely pre-installation instructions, installation instructions, post-installation instructions, help information updates, document presentation, and external resource management. We further provide detailed insights into modification behaviours and offer examples of these updates. Based on our findings, we propose a README template tailored to cover the installation-related sections for documentation maintainers to reference when updating documents. We further validate this template by conducting an online survey, identifying that documentation readers find the augmented documents based on our template are generally of better quality. We further provide recommendations to practitioners for maintaining their README files, as well as motivations for future research directions... (too long for arxiv)
Paper Structure (32 sections, 13 figures, 5 tables)

This paper contains 32 sections, 13 figures, 5 tables.

Figures (13)

  • Figure 1: Data Preparation Process
  • Figure 2: Ratio of non-software repositories among top n repositories
  • Figure 3: Comparison between repositories in our dataset and ones above 2,000 stars
  • Figure 4: Average Updated Sections Per Commit
  • Figure 5: Update Keywords Heat-Map (A: Added, R: Removed, M: Modified)
  • ...and 8 more figures