Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents

Zehao Wang; Dong Jae Kim; Tse-Hsun Chen

Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents

Zehao Wang, Dong Jae Kim, Tse-Hsun Chen

TL;DR

PerfSense tackles the problem of identifying performance-sensitive configurations in large software systems by using an LLM-based two-agent setup (DevAgent and PerfAgent) that iteratively analyzes configuration-relevant code with prompt chaining and retrieval-augmented generation. The approach is zero-shot and unsupervised, designed to minimize manual effort while handling large codebases via call-graph analysis and document retrieval. Empirical evaluation on seven open-source Java systems shows PerfSense achieving an average accuracy of $64.77\%$, outperforming the state-of-the-art DiagConfig and a ChatGPT baseline, with notable gains in recall when using prompt chaining. The results also include a detailed misclassification analysis and discuss practical considerations for adopting LLM-based code analysis in software performance engineering.

Abstract

Configuration settings are essential for tailoring software behavior to meet specific performance requirements. However, incorrect configurations are widespread, and identifying those that impact system performance is challenging due to the vast number and complexity of possible settings. In this work, we present PerfSense, a lightweight framework that leverages Large Language Models (LLMs) to efficiently identify performance-sensitive configurations with minimal overhead. PerfSense employs LLM agents to simulate interactions between developers and performance engineers using advanced prompting techniques such as prompt chaining and retrieval-augmented generation (RAG). Our evaluation of seven open-source Java systems demonstrates that PerfSense achieves an average accuracy of 64.77% in classifying performance-sensitive configurations, outperforming both our LLM baseline (50.36%) and the previous state-of-the-art method (61.75%). Notably, our prompt chaining technique improves recall by 10% to 30% while maintaining similar precision levels. Additionally, a manual analysis of 362 misclassifications reveals common issues, including LLMs' misunderstandings of requirements (26.8%). In summary, PerfSense significantly reduces manual effort in classifying performance-sensitive configurations and offers valuable insights for future LLM-based code analysis research.

Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents

TL;DR

, outperforming the state-of-the-art DiagConfig and a ChatGPT baseline, with notable gains in recall when using prompt chaining. The results also include a detailed misclassification analysis and discuss practical considerations for adopting LLM-based code analysis in software performance engineering.

Abstract

Paper Structure (23 sections, 5 figures, 4 tables)

This paper contains 23 sections, 5 figures, 4 tables.

Introduction
Background
Performance-Sensitive Configurations
LLM-based Multi-agent Framework
Related Work
Performance Analysis of Configuration
Using LLMs to Analyze Configuration
Multi-Agent Based Code Analysis
Design of PerfSense
Agent Roles and Definition
Developer Agent: Retrieving Configuration-Related Code
Performance Expert Agent: Analyzing the Performance Sensitivity of Configuration
Multi-Agent Communications
Prompt Chaining to Iteratively Build Code Understanding
Retrieval Augmented Generation for Performance Classifier
...and 8 more sections

Figures (5)

Figure 1: Overview of PerfSense
Figure 2: An example of performance-sensitive configuration.
Figure 3: DevAgent's Performance-Aware Code Review.
Figure 4: PerfAgent's Prompt for Code Understanding.
Figure 5: Prompt Template 3: Retrieval Augmented Generation for Performance Classifier

Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents

TL;DR

Abstract

Identifying Performance-Sensitive Configurations in Software Systems through Code Analysis with LLM Agents

Authors

TL;DR

Abstract

Table of Contents

Figures (5)