Table of Contents
Fetching ...

ATTest: Agent-Driven Tensor Testing for Deep Learning Library Modules

Zhengyu Zhan, Ye Shang, Jiawei Liu, Chunrong Fang, Quanjun Zhang, Zhenyu Chen

TL;DR

ATTest, an agent-driven tensor testing framework for module-level unit test generation, orchestrates a seven-stage pipeline, which encompasses constraint extraction and an iterative"generation-validation-repair"loop, to maintain testing stability and mitigate context-window saturation.

Abstract

The unit testing of Deep Learning (DL) libraries is challenging due to complex numerical semantics and implicit tensor constraints. Traditional Search-Based Software Testing (SBST) often suffers from semantic blindness, failing to satisfy the constraints of high-dimensional tensors, whereas Large Language Models (LLMs) struggle with cross-file context and unstable code modifications. This paper proposes ATTest, an agent-driven tensor testing framework for module-level unit test generation. ATTest orchestrates a seven-stage pipeline, which encompasses constraint extraction and an iterative "generation-validation-repair" loop, to maintain testing stability and mitigate context-window saturation. An evaluation on PyTorch and TensorFlow demonstrates that ATTest significantly outperforms state-of-the-art baselines such as PynguinML, achieving an average branch coverage of 55.60% and 54.77%, respectively. The results illustrate how agent-driven workflows bridge the semantic gap in numerical libraries while ensuring auditable test synthesis. Source code: https://github.com/iSEngLab/ATTest.git

ATTest: Agent-Driven Tensor Testing for Deep Learning Library Modules

TL;DR

ATTest, an agent-driven tensor testing framework for module-level unit test generation, orchestrates a seven-stage pipeline, which encompasses constraint extraction and an iterative"generation-validation-repair"loop, to maintain testing stability and mitigate context-window saturation.

Abstract

The unit testing of Deep Learning (DL) libraries is challenging due to complex numerical semantics and implicit tensor constraints. Traditional Search-Based Software Testing (SBST) often suffers from semantic blindness, failing to satisfy the constraints of high-dimensional tensors, whereas Large Language Models (LLMs) struggle with cross-file context and unstable code modifications. This paper proposes ATTest, an agent-driven tensor testing framework for module-level unit test generation. ATTest orchestrates a seven-stage pipeline, which encompasses constraint extraction and an iterative "generation-validation-repair" loop, to maintain testing stability and mitigate context-window saturation. An evaluation on PyTorch and TensorFlow demonstrates that ATTest significantly outperforms state-of-the-art baselines such as PynguinML, achieving an average branch coverage of 55.60% and 54.77%, respectively. The results illustrate how agent-driven workflows bridge the semantic gap in numerical libraries while ensuring auditable test synthesis. Source code: https://github.com/iSEngLab/ATTest.git
Paper Structure (9 sections, 1 equation, 3 figures, 1 table)

This paper contains 9 sections, 1 equation, 3 figures, 1 table.

Figures (3)

  • Figure 1: Workflow of ATTest.
  • Figure 2: Example of an analysis_plan.json.
  • Figure 3: Example of a generated unit test.