A Human Behavioral Baseline for Collective Governance in Software Projects
Mobina Noori, Mahasweta Chakraborti, Amy X Zhang, Seth Frey
TL;DR
This work builds a large-scale longitudinal baseline of open-source governance prior to AI-managed tooling by parsing version-controlled GOVERNANCE.md documents into institutional statements using Institutional Grammar. It quantifies how authority and participation evolve through changes in the number and distribution of Roles and Actions (breadth and evenness) while Deontic polarity remains largely stable, using metrics such as $H$, $K$, and $JSD$ across paired snapshots. The findings suggest governance maturation via accretion rather than sweeping reallocation, providing a reproducible framework to evaluate future AI-mediated workflows for concentration, drift, and breadth. The results offer practical guidance for designing participatory AI systems that surface governance deltas and maintain transparent authority dynamics in collaborative software projects.
Abstract
We study how open source communities describe participation and control through version controlled governance documents. Using a corpus of 710 projects with paired snapshots, we parse text into actors, rules, actions, and objects, then group them and measure change with entropy for evenness, richness for diversity, and Jensen Shannon divergence for drift. Projects define more roles and more actions over time, and these are distributed more evenly, while the composition of rules remains stable. These findings indicate that governance grows by expanding and balancing categories of participation without major shifts in prescriptive force. The analysis provides a reproducible baseline for evaluating whether future AI mediated workflows concentrate or redistribute authority.
