Mining for sustainability in cloud architecture among the discussions of software practitioners: building a dataset
Sahar Ahmadisakha, Vasilios Andrikopoulos
TL;DR
The paper investigates sustainability in cloud architecture from practitioners by mining discussions on the Stack Exchange Software Engineering forum. It employs software repository mining (MSR) guided by ACM empirical standards and a Goal-Question-Metric framework to build an annotated dataset of cloud architectural discussions and map quality requirements to sustainability dimensions. The study finds that practitioners emphasize technical and economic sustainability, with environmental concerns being less prominent than in academic literature, and that discussions focus heavily on design activities (analysis, synthesis, implementation) rather than evaluation or maintenance. The resulting dataset provides a practitioner-centered resource for understanding current sustainability considerations in cloud architecture and enables future replication and extension across additional data sources and dimensions.
Abstract
The adoption of cloud computing is steadily increasing in designing and implementing software systems, thus it becomes imperative to consider the sustainability implications of these processes. While there has already been some academic research on this topic, there is a lack of perspective from practitioners. To bridge this gap, we utilize software repository mining techniques to examine 192 discussions among practitioners on the Software Engineering forum of the StackExchange platform, aiming to build an annotated dataset containing cloud architectural discussions and to understand the current discussion on sustainability in cloud architecture. To identify these discussions, we first put together a list of terms indicating sustainability as the topic. Our initial findings indicate practitioners mainly focus on design aspects (analysis, synthesis, and implementation) while avoiding complex activities like evaluation and maintenance. Technical sustainability is emphasized, while the economic dimension has the most discussions exclusively focused on it. This contrasts with previous academic literature, which highlighted environmental sustainability.
