Agentic LLMs for REST API Test Amplification: A Comparative Study Across Cloud Applications

Jarne Besjes; Robbe Nooyens; Tolgahan Bardakci; Mutlu Beyazit; Serge Demeyer

Agentic LLMs for REST API Test Amplification: A Comparative Study Across Cloud Applications

Jarne Besjes, Robbe Nooyens, Tolgahan Bardakci, Mutlu Beyazit, Serge Demeyer

TL;DR

This work extends LLM-driven REST API test amplification to five cloud applications, evaluating both single-agent and multi-agent configurations to assess generalization, coverage, and efficiency. The authors combine a LangGraph-based workflow with OpenAPI retrieval and specialized planning/generation agents to produce executable, readable tests that expand endpoint, parameter, and input-space coverage while preserving semantics. Across varied architectures (production-grade and controlled environments), agentic LLM systems achieve high structural API coverage, with multi-agent setups offering broader input exploration at the cost of higher time, token, and energy expenditure. The study provides practical insights into trade-offs between accuracy, scalability, and sustainability, and outlines a path toward autonomous, CI-friendly REST API testing in complex cloud ecosystems.

Abstract

Representational State Transfer (REST) APIs are a cornerstone of modern cloud native systems. Ensuring their reliability demands automated test suites that exercise diverse and boundary level behaviors. Nevertheless, designing such test cases remains a challenging and resource intensive endeavor. This study extends prior work on Large Language Model (LLM) based test amplification by evaluating single agent and multi agent configurations across four additional cloud applications. The amplified test suites maintain semantic validity with minimal human intervention. The results demonstrate that agentic LLM systems can effectively generalize across heterogeneous API architectures, increasing endpoint and parameter coverage while revealing defects. Moreover, a detailed analysis of computational cost, runtime, and energy consumption highlights trade-offs between accuracy, scalability, and efficiency. These findings underscore the potential of LLM driven test amplification to advance the automation and sustainability of REST API testing in complex cloud environments.

Agentic LLMs for REST API Test Amplification: A Comparative Study Across Cloud Applications

TL;DR

Abstract

Agentic LLMs for REST API Test Amplification: A Comparative Study Across Cloud Applications

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)