How Do Communities of ML-Enabled Systems Smell? A Cross-Sectional Study on the Prevalence of Community Smells
Giusy Annunziata, Stefano Lambiase, Fabio Palomba, Gemma Catolino, Filomena Ferrucci
TL;DR
This study investigates the socio-technical health of ML-enabled software teams by examining community smells in 188 open-source projects from the NICHE dataset, using CADOCS to detect 10 smells. It combines cross-sectional prevalence analysis with a longitudinal, 3-month window study over up to 2 years and employs Prevalence Odds Ratios to explore inter-smell associations. Key findings show the Prima Donna Effect (PDE) as the most prevalent and persistent smell (≈92–96%), with other smells fluctuating over time; several smells correlate positively (notably PDE and Organizational Silo Effect) while many correlations are negative. The work provides actionable insights for ML project managers to mitigate social debt, guides researchers in studying socio-technical dynamics in ML contexts, and establishes a baseline for future longitudinal and causal investigations.
Abstract
Effective software development relies on managing both collaboration and technology, but sociotechnical challenges can harm team dynamics and increase technical debt. Although teams working on ML enabled systems are interdisciplinary, research has largely focused on technical issues, leaving their socio-technical dynamics underexplored. This study aims to address this gap by examining the prevalence, evolution, and interrelations of community smells, in open-source ML projects. We conducted an empirical study on 188 repositories from the NICHE dataset using the CADOCS tool to identify and analyze community smells. Our analysis focused on their prevalence, interrelations, and temporal variations. We found that certain smells, such as Prima Donna Effects and Sharing Villainy, are more prevalent and fluctuate over time compared to others like Radio Silence or Organizational Skirmish. These insights might provide valuable support for ML project managers in addressing socio-technical issues and improving team coordination.
