Table of Contents
Fetching ...

MARE: Multi-Agents Collaboration Framework for Requirements Engineering

Dongming Jin, Zhi Jin, Xiaohong Chen, Chunhui Wang

TL;DR

This work tackles end-to-end Requirements Engineering (RE) automation by introducing MARE, a multi-agent, LLM-driven framework that coordinates elicitation, modeling, verification, and specification through a shared workspace and nine prompts. Five specialized agents collaborate to generate intermediate artifacts and final software requirements specifications, outperforming three state-of-the-art baselines in modeling tasks and achieving strong human-evaluated spec quality. An ablation study confirms the value of inter-agent collaboration over single-LLM approaches. The approach demonstrates practical potential for automated RE workflows and informs future multi-agent, prompt-engineered pipeline designs in software engineering.

Abstract

Requirements Engineering (RE) is a critical phase in the software development process that generates requirements specifications from stakeholders' needs. Recently, deep learning techniques have been successful in several RE tasks. However, obtaining high-quality requirements specifications requires collaboration across multiple tasks and roles. In this paper, we propose an innovative framework called MARE, which leverages collaboration among large language models (LLMs) throughout the entire RE process. MARE divides the RE process into four tasks: elicitation, modeling, verification, and specification. Each task is conducted by engaging one or two specific agents and each agent can conduct several actions. MARE has five agents and nine actions. To facilitate collaboration between agents, MARE has designed a workspace for agents to upload their generated intermediate requirements artifacts and obtain the information they need. We conduct experiments on five public cases, one dataset, and four new cases created by this work. We compared MARE with three baselines using three widely used metrics for the generated requirements models. Experimental results show that MARE can generate more correct requirements models and outperform the state-of-the-art approaches by 15.4%. For the generated requirements specifications, we conduct a human evaluation in three aspects and provide insights about the quality

MARE: Multi-Agents Collaboration Framework for Requirements Engineering

TL;DR

This work tackles end-to-end Requirements Engineering (RE) automation by introducing MARE, a multi-agent, LLM-driven framework that coordinates elicitation, modeling, verification, and specification through a shared workspace and nine prompts. Five specialized agents collaborate to generate intermediate artifacts and final software requirements specifications, outperforming three state-of-the-art baselines in modeling tasks and achieving strong human-evaluated spec quality. An ablation study confirms the value of inter-agent collaboration over single-LLM approaches. The approach demonstrates practical potential for automated RE workflows and informs future multi-agent, prompt-engineered pipeline designs in software engineering.

Abstract

Requirements Engineering (RE) is a critical phase in the software development process that generates requirements specifications from stakeholders' needs. Recently, deep learning techniques have been successful in several RE tasks. However, obtaining high-quality requirements specifications requires collaboration across multiple tasks and roles. In this paper, we propose an innovative framework called MARE, which leverages collaboration among large language models (LLMs) throughout the entire RE process. MARE divides the RE process into four tasks: elicitation, modeling, verification, and specification. Each task is conducted by engaging one or two specific agents and each agent can conduct several actions. MARE has five agents and nine actions. To facilitate collaboration between agents, MARE has designed a workspace for agents to upload their generated intermediate requirements artifacts and obtain the information they need. We conduct experiments on five public cases, one dataset, and four new cases created by this work. We compared MARE with three baselines using three widely used metrics for the generated requirements models. Experimental results show that MARE can generate more correct requirements models and outperform the state-of-the-art approaches by 15.4%. For the generated requirements specifications, we conduct a human evaluation in three aspects and provide insights about the quality
Paper Structure (15 sections, 4 figures, 7 tables)

This paper contains 15 sections, 4 figures, 7 tables.

Figures (4)

  • Figure 1: The overview of our MARE
  • Figure 2: The profile of the Collector agent.
  • Figure 3: The mechanism of shared workspace.
  • Figure 4: The action migration diagram in MARE