Table of Contents
Fetching ...

Neural Embeddings for Web Testing

Kasun Kanaththage, Luigi Libero Lucio Starace, Matteo Biagiola, Paolo Tonella, Andrea Stocco

TL;DR

A model-based test generation approach that employs transformer-based Siamese neural networks (SNNs) to infer web application models more accurately, resulting in superior web app models with an average F-1 score improvement of 56%.

Abstract

Web test automation techniques often rely on crawlers to infer models of web applications for automated test generation. However, current crawlers rely on state equivalence algorithms that struggle to distinguish near-duplicate pages, often leading to redundant test cases and incomplete coverage of application functionality. In this paper, we present a model-based test generation approach that employs transformer-based Siamese neural networks (SNNs) to infer web application models more accurately. By learning similarity-based representations, SNNs capture structural and textual relationships among web pages, improving near-duplicate detection during crawling and enhancing the quality of inferred models, and thus, the effectiveness of generated test suites. Our evaluation across nine web apps shows that SNNs outperform state-of-the-art techniques in near-duplicate detection, resulting in superior web app models with an average F-1 score improvement of 56%. These enhanced models enable the generation of more effective test suites that achieve higher code coverage, with improvements ranging from 6% to 21% and averaging at 12%.

Neural Embeddings for Web Testing

TL;DR

A model-based test generation approach that employs transformer-based Siamese neural networks (SNNs) to infer web application models more accurately, resulting in superior web app models with an average F-1 score improvement of 56%.

Abstract

Web test automation techniques often rely on crawlers to infer models of web applications for automated test generation. However, current crawlers rely on state equivalence algorithms that struggle to distinguish near-duplicate pages, often leading to redundant test cases and incomplete coverage of application functionality. In this paper, we present a model-based test generation approach that employs transformer-based Siamese neural networks (SNNs) to infer web application models more accurately. By learning similarity-based representations, SNNs capture structural and textual relationships among web pages, improving near-duplicate detection during crawling and enhancing the quality of inferred models, and thus, the effectiveness of generated test suites. Our evaluation across nine web apps shows that SNNs outperform state-of-the-art techniques in near-duplicate detection, resulting in superior web app models with an average F-1 score improvement of 56%. These enhanced models enable the generation of more effective test suites that achieve higher code coverage, with improvements ranging from 6% to 21% and averaging at 12%.
Paper Structure (46 sections, 4 equations, 6 figures, 7 tables, 1 algorithm)

This paper contains 46 sections, 4 equations, 6 figures, 7 tables, 1 algorithm.

Figures (6)

  • Figure 1: Left: Hotel reservation web app: app states with actionable outlined. Right: An inferred model with near-duplicates.
  • Figure 2: BCE-based SNN Architecture.
  • Figure 3: Triplet-Based SNN Architecture
  • Figure 4: Overview of our approach.
  • Figure 5: Chunking with a 128-token overlap.
  • ...and 1 more figures