WEFix: Intelligent Automatic Generation of Explicit Waits for Efficient Web End-to-End Flaky Tests

Xinyue Liu; Zihe Song; Weike Fang; Wei Yang; Weihang Wang

WEFix: Intelligent Automatic Generation of Explicit Waits for Efficient Web End-to-End Flaky Tests

Xinyue Liu, Zihe Song, Weike Fang, Wei Yang, Weihang Wang

TL;DR

WEFix tackles UI-based flakiness in web e2e tests by automatically generating explicit waits from observed DOM mutations. It combines Mutation Recorder (test-code instrumentation and browser mutation capture) with an Oracle Generator (pruning mutations, a mutation FSM, and wait oracle construction) to insert waits that fire only when necessary. In evaluation across seven real-world projects and 122 reproduced flaky tests, WEFix fixed 98.4% of cases and reduced runtime overhead from implicit-wait baselines to about 1.25x on average, outperforming fixed implicit-wait strategies. The approach is available as an open-source npm tool, enabling developers to automatically stabilize flaky tests with minimal performance impact.

Abstract

Web end-to-end (e2e) testing evaluates the workflow of a web application. It simulates real-world user scenarios to ensure the application flows behave as expected. However, web e2e tests are notorious for being flaky, i.e., the tests can produce inconsistent results despite no changes to the code. One common type of flakiness is caused by nondeterministic execution orders between the test code and the client-side code under test. In particular, UI-based flakiness emerges as a notably prevalent and challenging issue to fix because the test code has limited knowledge about the client-side code execution. In this paper, we propose WEFix, a technique that can automatically generate fix code for UI-based flakiness in web e2e testing. The core of our approach is to leverage browser UI changes to predict the client-side code execution and generate proper wait oracles. We evaluate the effectiveness and efficiency of WEFix against 122 web e2e flaky tests from seven popular real-world projects. Our results show that WEFix dramatically reduces the overhead (from 3.7$\times$ to 1.25$\times$) while achieving a high correctness (98%).

WEFix: Intelligent Automatic Generation of Explicit Waits for Efficient Web End-to-End Flaky Tests

TL;DR

Abstract

to 1.25

) while achieving a high correctness (98%).

Paper Structure (23 sections, 2 equations, 6 figures, 4 tables, 2 algorithms)

This paper contains 23 sections, 2 equations, 6 figures, 4 tables, 2 algorithms.

Introduction
Background
Web E2e UI Test
UI-Based Flakiness
Flaky-Prone Commands
WEFix
Mutation Recorder
Test Code Runtime
Browser Runtime
Oracle Generator
Pruning Irrelevant Mutations.
Mutation State Machine
Oracle Generation Algorithm
Adding Explicit Waits
Evaluation
...and 8 more sections

Figures (6)

Figure 1: The web page of the example shown in Listing \ref{['lst:example']}.
Figure 2: Interleaved mutations triggered by two consecutive commands.
Figure 3: The workflow of WEFix.
Figure 4: Mutation Recorder runtime.
Figure 5: Finite state machine (FSM) representing DOM tree status transitions via mutations $m_1,m_2,...,m_j$.
...and 1 more figures

WEFix: Intelligent Automatic Generation of Explicit Waits for Efficient Web End-to-End Flaky Tests

TL;DR

Abstract

WEFix: Intelligent Automatic Generation of Explicit Waits for Efficient Web End-to-End Flaky Tests

Authors

TL;DR

Abstract

Table of Contents

Figures (6)