Understanding Documentation Use Through Log Analysis: An Exploratory Case Study of Four Cloud Services
Daye Nam, Andrew Macvean, Brad Myers, Bogdan Vasilescu
TL;DR
This study demonstrates that documentation page-view logs can be mined at scale to reveal meaningful patterns in how developers use API documentation. Using a two-phase approach on four Google services, the authors first cluster usage patterns from monthly dwell-time vectors across 11 page types, then test hypotheses linking user experience, product type, and intent to both documentation usage and subsequent API adoption via logistic regression. Key findings show that experience and product context shape which documentation genres are consulted, and that engagement with guide documentation strongly predicts future API use, with variations across products. The authors argue for the feasibility of log-based documentation reviews and outline practical design recommendations and a longer-term personalization vision to improve developer onboarding and information foraging.
Abstract
Almost no modern software system is written from scratch, and developers are required to effectively learn to use third-party libraries or software services. Thus, many practitioners and researchers have looked for ways to create effective documentation that supports developers' learning. However, few efforts have focused on how people actually use the documentation. In this paper, we report on an exploratory, multi-phase, mixed methods empirical study of documentation page-view logs from four cloud-based industrial services. By analyzing page-view logs for over 100,000 users, we find diverse patterns of documentation page visits. Moreover, we show statistically that which documentation pages people visit often correlates with user characteristics such as past experience with the specific product, on the one hand, and with future adoption of the API on the other hand. We discuss the implications of these results on documentation design and propose documentation page-view log analysis as a feasible technique for design audits of documentation, from ones written for software developers to ones designed to support end users (e.g., Adobe Photoshop).
