Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review

Tien Rahayu Tulili; Ayushi Rastogi; Andrea Capiluppi

Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review

Tien Rahayu Tulili, Ayushi Rastogi, Andrea Capiluppi

Abstract

Burnout is an occupational syndrome that, like many other professions, affects the majority of software engineers. Past research studies showed important trends, including an increasing use of machine learning techniques to allow for an early detection of burnout. This paper is a systematic literature review (SLR) of the research papers that proposed machine learning (ML) approaches, and focused on detecting burnout in software developers and IT professionals. Our objective is to review the accuracy and precision of the proposed ML techniques, and to formulate recommendations for future researchers interested to replicate or extend those studies. From our SLR we observed that a majority of primary studies focuses on detecting emotions or utilise emotional dimensions to detect or predict the presence of burnout. We also performed a cross-sectional study to detect which ML approach shows a better performance at detecting emotions; and which dataset has more potential and expressivity to capture emotions. We believe that, by identifying which ML tools and datasets show a better performance at detecting emotions, and indirectly at identifying burnout, our paper can be a valuable asset to progress in this important research direction.

Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review

Abstract

Paper Structure (34 sections, 8 figures, 7 tables)

This paper contains 34 sections, 8 figures, 7 tables.

Introduction
Related Works
Methodology
Research Questions
Identification of Relevant Studies
Definition of the search string
Selection of Databases
Filtering and inclusion criteria
Snowballing
Study Quality Assessment
Data Extraction
Data Synthesis
Findings - (RQ1)
Findings - (RQ2)
Sensor-based studies
...and 19 more sections

Figures (8)

Figure 1: Pipeline used in this SLR to obtain the relevant primary studies.
Figure 2: Number of papers grouped by purpose of study and input type.
Figure 3: Model Performances of Emotion and Stress Detection with sensor and movement data as the inputs and machine learning techniques categorised based on their most generalised characteristics. Bayesian (e.g. Naïve Bayes, Bayes Net), Ensemble method (e.g., AdaBoast, Light Gradient Boasting Method), Instance-based algorithm (e.g. KNN, K-star, IBk), Linear Model (e.g., Logistic Regression), NN-based algorithms (e.g. CNN, RestNet), Ruled-based Algorithm (e.g. ZeroR), Tree-based Algorithms (e.g., DT, J48, RF, C.45)
Figure 4: Model Performances of attrition detection with machine learning techniques categorised based on their most generalised characteristics. Bayesian (e.g. Naïve Bayes), Instance-based algorithm (e.g. KNN), Linear Model (e.g., Logistic Regression), NN-based algorithms (e.g. MLP), Tree-based Algorithms (e.g., DT, RF), Ensemble (e.g., Adaptive Boost), and Kernel-based algorithms (e.g., SVM, SVC)
Figure 5: Model Performances of toxicity detection with machine learning techniques categorised based on their most generalised characteristics. Bayesian (e.g. Naïve Bayes), Instance-based algorithm (e.g. KNN), Linear Model (e.g., Logistic Regression), NN-based algorithms (e.g. Deep Pyramid CNN, Strudel, DPCNN, DCRNN, LSTM, BiLSTM, and GRU), Tree-based Algorithms (e.g., DT, GBT, RF, CART), and Transformed-based model (e.g., BERT, RoBERTa, DistilBERT, ALBERT, XLNet, ChatGPT, and GPT4.0)
...and 3 more figures

Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review

Abstract

Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review

Authors

Abstract

Table of Contents

Figures (8)