Table of Contents
Fetching ...

Automatic Bias Detection in Source Code Review

Yoseph Berhanu Alebachew, Chris Brown

TL;DR

This work addresses cognitive bias in code reviews and the need for automated detection to improve fairness and efficiency. It proposes a controlled gaze-tracking study grounded in the spotlight-attention model to detect biased reviewer interactions. The methodology envisions analyzing gaze sequences with Markov models, RNNs, CRFs, and transformer-based architectures, augmented by multi-modal data such as eye-tracking, video, and interaction logs. The anticipated contributions include a real-time bias feedback framework and a psychology-informed, multimodal approach to mitigate bias in software development.

Abstract

Bias is an inherent threat to human decision-making, including in decisions made during software development. Extensive research has demonstrated the presence of biases at various stages of the software development life-cycle. Notably, code reviews are highly susceptible to prejudice-induced biases, and individuals are often unaware of these biases as they occur. Developing methods to automatically detect these biases is crucial for addressing the associated challenges. Recent advancements in visual data analytics have shown promising results in detecting potential biases by analyzing user interaction patterns. In this project, we propose a controlled experiment to extend this approach to detect potentially biased outcomes in code reviews by observing how reviewers interact with the code. We employ the "spotlight model of attention", a cognitive framework where a reviewer's gaze is tracked to determine their focus areas on the review screen. This focus, identified through gaze tracking, serves as an indicator of the reviewer's areas of interest or concern. We plan to analyze the sequence of gaze focus using advanced sequence modeling techniques, including Markov Models, Recurrent Neural Networks (RNNs), and Conditional Random Fields (CRF). These techniques will help us identify patterns that may suggest biased interactions. We anticipate that the ability to automatically detect potentially biased interactions in code reviews will significantly reduce unnecessary push-backs, enhance operational efficiency, and foster greater diversity and inclusion in software development. This approach not only helps in identifying biases but also in creating a more equitable development environment by mitigating these biases effectively

Automatic Bias Detection in Source Code Review

TL;DR

This work addresses cognitive bias in code reviews and the need for automated detection to improve fairness and efficiency. It proposes a controlled gaze-tracking study grounded in the spotlight-attention model to detect biased reviewer interactions. The methodology envisions analyzing gaze sequences with Markov models, RNNs, CRFs, and transformer-based architectures, augmented by multi-modal data such as eye-tracking, video, and interaction logs. The anticipated contributions include a real-time bias feedback framework and a psychology-informed, multimodal approach to mitigate bias in software development.

Abstract

Bias is an inherent threat to human decision-making, including in decisions made during software development. Extensive research has demonstrated the presence of biases at various stages of the software development life-cycle. Notably, code reviews are highly susceptible to prejudice-induced biases, and individuals are often unaware of these biases as they occur. Developing methods to automatically detect these biases is crucial for addressing the associated challenges. Recent advancements in visual data analytics have shown promising results in detecting potential biases by analyzing user interaction patterns. In this project, we propose a controlled experiment to extend this approach to detect potentially biased outcomes in code reviews by observing how reviewers interact with the code. We employ the "spotlight model of attention", a cognitive framework where a reviewer's gaze is tracked to determine their focus areas on the review screen. This focus, identified through gaze tracking, serves as an indicator of the reviewer's areas of interest or concern. We plan to analyze the sequence of gaze focus using advanced sequence modeling techniques, including Markov Models, Recurrent Neural Networks (RNNs), and Conditional Random Fields (CRF). These techniques will help us identify patterns that may suggest biased interactions. We anticipate that the ability to automatically detect potentially biased interactions in code reviews will significantly reduce unnecessary push-backs, enhance operational efficiency, and foster greater diversity and inclusion in software development. This approach not only helps in identifying biases but also in creating a more equitable development environment by mitigating these biases effectively

Paper Structure

This paper contains 14 sections.