Automating Structural Engineering Workflows with Large Language Model Agents
Haoran Liang, Yufa Zhou, Mohammad Talebi Kalaleh, Qipei Mei
TL;DR
MASSE presents a first-of-its-kind Multi-Agent System for Structural Engineering that couples LLM-based agents with real-world workflows to automate end-to-end design, analysis, and verification. By organizing agents into Analyst, Engineer, and Management teams and embedding memory and structured tool use (e.g., FEM solvers, code documents), MASSE achieves large reductions in expert workload while maintaining reliability. The authors validate MASSE with a domain-specific racking system dataset and show performance competitive with larger models, highlighting a strong trade-off between speed, cost, and accuracy and demonstrating real-world deployment potential. The work advances the practical application of retrieval-augmented, tool-assisted reasoning in safety-critical engineering and suggests broader applicability to other verbalizable, tool-mediated domains.
Abstract
We introduce $\textbf{MASSE}$, the first Multi-Agent System for Structural Engineering, effectively integrating large language model (LLM)-based agents with real-world engineering workflows. Structural engineering is a fundamental yet traditionally stagnant domain, with core workflows remaining largely unchanged for decades despite its substantial economic impact and global market size. Recent advancements in LLMs have significantly enhanced their ability to perform complex reasoning, long-horizon planning, and precise tool utilization -- capabilities well aligned with structural engineering tasks such as interpreting design codes, executing load calculations, and verifying structural capacities. We present a proof-of-concept showing that most real-world structural engineering workflows can be fully automated through a training-free LLM-based multi-agent system. MASSE enables immediate deployment in professional environments, and our comprehensive validation on real-world case studies demonstrates that it can reduce expert workload from approximately two hours to mere minutes, while enhancing both reliability and accuracy in practical engineering scenarios.
