On (Mis)perceptions of testing effectiveness: an empirical study

Sira Vegas; Patricia Riofrio; Esperanza Marcos; Natalia Juristo

On (Mis)perceptions of testing effectiveness: an empirical study

Sira Vegas, Patricia Riofrio, Esperanza Marcos, Natalia Juristo

TL;DR

This work investigates whether developers’ perceptions of defect-detection technique effectiveness reflect reality in the absence of prior experience. Through an original controlled experiment and a replication with 46 students, the authors show that perceptions are often wrong and incur a measurable cost to software quality. The replicated study adds participants’ opinions to probe drivers behind misperceptions, finding that opinions do not reliably predict actual effectiveness and that perceived effectiveness is linked to how well techniques are applied. The results motivate the design of feedback tools and broader empirical work to reduce reliance on biased perceptions when selecting defect-detection strategies, with practical implications for practice and research.

Abstract

A recurring problem in software development is incorrect decision making on the techniques, methods and tools to be used. Mostly, these decisions are based on developers' perceptions about them. A factor influencing people's perceptions is past experience, but it is not the only one. In this research, we aim to discover how well the perceptions of the defect detection effectiveness of different techniques match their real effectiveness in the absence of prior experience. To do this, we conduct an empirical study plus a replication. During the original study, we conduct a controlled experiment with students applying two testing techniques and a code review technique. At the end of the experiment, they take a survey to find out which technique they perceive to be most effective. The results show that participants' perceptions are wrong and that this mismatch is costly in terms of quality. In order to gain further insight into the results, we replicate the controlled experiment and extend the survey to include questions about participants' opinions on the techniques and programs. The results of the replicated study confirm the findings of the original study and suggest that participants' perceptions might be based not on their opinions about complexity or preferences for techniques but on how well they think that they have applied the techniques.

On (Mis)perceptions of testing effectiveness: an empirical study

TL;DR

Abstract

Paper Structure (56 sections, 10 figures, 53 tables)

This paper contains 56 sections, 10 figures, 53 tables.

Introduction
Original Study: Research Questions and Methodology
Research Questions
Study Context and Ethics
Constructs Operationalization
Study Design
Experimental Objects
Participants
Data Analysis
Original Study: Validity Threats
Conclusion Validity
Internal Validity
Construct Validity
External Validity
Original Study: Results
...and 41 more sections

Figures (10)

Figure 1: Scatterplot for observed mismatch cost in the original study. Datapoints correspond to the mismatch cost in Table \ref{['tab:s1_cost']}
Figure 2: Scatterplot for observed mismatch cost in the replicated study
Figure 3: Boxplot for observed technique effectiveness in the original study
Figure 4: Normal Q-Q plot of residuals in the original study
Figure 5: Profile plot for program by technique interaction in the original study
...and 5 more figures

On (Mis)perceptions of testing effectiveness: an empirical study

TL;DR

Abstract

On (Mis)perceptions of testing effectiveness: an empirical study

Authors

TL;DR

Abstract

Table of Contents

Figures (10)