Table of Contents
Fetching ...

Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review

Tien Rahayu Tulili, Ayushi Rastogi, Andrea Capiluppi

Abstract

Burnout is an occupational syndrome that, like many other professions, affects the majority of software engineers. Past research studies showed important trends, including an increasing use of machine learning techniques to allow for an early detection of burnout. This paper is a systematic literature review (SLR) of the research papers that proposed machine learning (ML) approaches, and focused on detecting burnout in software developers and IT professionals. Our objective is to review the accuracy and precision of the proposed ML techniques, and to formulate recommendations for future researchers interested to replicate or extend those studies. From our SLR we observed that a majority of primary studies focuses on detecting emotions or utilise emotional dimensions to detect or predict the presence of burnout. We also performed a cross-sectional study to detect which ML approach shows a better performance at detecting emotions; and which dataset has more potential and expressivity to capture emotions. We believe that, by identifying which ML tools and datasets show a better performance at detecting emotions, and indirectly at identifying burnout, our paper can be a valuable asset to progress in this important research direction.

Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review

Abstract

Burnout is an occupational syndrome that, like many other professions, affects the majority of software engineers. Past research studies showed important trends, including an increasing use of machine learning techniques to allow for an early detection of burnout. This paper is a systematic literature review (SLR) of the research papers that proposed machine learning (ML) approaches, and focused on detecting burnout in software developers and IT professionals. Our objective is to review the accuracy and precision of the proposed ML techniques, and to formulate recommendations for future researchers interested to replicate or extend those studies. From our SLR we observed that a majority of primary studies focuses on detecting emotions or utilise emotional dimensions to detect or predict the presence of burnout. We also performed a cross-sectional study to detect which ML approach shows a better performance at detecting emotions; and which dataset has more potential and expressivity to capture emotions. We believe that, by identifying which ML tools and datasets show a better performance at detecting emotions, and indirectly at identifying burnout, our paper can be a valuable asset to progress in this important research direction.
Paper Structure (34 sections, 8 figures, 7 tables)

This paper contains 34 sections, 8 figures, 7 tables.

Figures (8)

  • Figure 1: Pipeline used in this SLR to obtain the relevant primary studies.
  • Figure 2: Number of papers grouped by purpose of study and input type.
  • Figure 3: Model Performances of Emotion and Stress Detection with sensor and movement data as the inputs and machine learning techniques categorised based on their most generalised characteristics. Bayesian (e.g. Naïve Bayes, Bayes Net), Ensemble method (e.g., AdaBoast, Light Gradient Boasting Method), Instance-based algorithm (e.g. KNN, K-star, IBk), Linear Model (e.g., Logistic Regression), NN-based algorithms (e.g. CNN, RestNet), Ruled-based Algorithm (e.g. ZeroR), Tree-based Algorithms (e.g., DT, J48, RF, C.45)
  • Figure 4: Model Performances of attrition detection with machine learning techniques categorised based on their most generalised characteristics. Bayesian (e.g. Naïve Bayes), Instance-based algorithm (e.g. KNN), Linear Model (e.g., Logistic Regression), NN-based algorithms (e.g. MLP), Tree-based Algorithms (e.g., DT, RF), Ensemble (e.g., Adaptive Boost), and Kernel-based algorithms (e.g., SVM, SVC)
  • Figure 5: Model Performances of toxicity detection with machine learning techniques categorised based on their most generalised characteristics. Bayesian (e.g. Naïve Bayes), Instance-based algorithm (e.g. KNN), Linear Model (e.g., Logistic Regression), NN-based algorithms (e.g. Deep Pyramid CNN, Strudel, DPCNN, DCRNN, LSTM, BiLSTM, and GRU), Tree-based Algorithms (e.g., DT, GBT, RF, CART), and Transformed-based model (e.g., BERT, RoBERTa, DistilBERT, ALBERT, XLNet, ChatGPT, and GPT4.0)
  • ...and 3 more figures