Retrieval Augmented (Knowledge Graph), and Large Language Model-Driven Design Structure Matrix (DSM) Generation of Cyber-Physical Systems
H. Sinan Bank, Daniel R. Herber
TL;DR
This work investigates automated DSM generation for cyber-physical systems using augmented language approaches (RAG) and graph-grounded retrieval (GraphRAG), validated on two representative use cases (a power screwdriver and a CubeSat). By comparing baseline LLMs, RAG, and GraphRAG across component-identification and relationship-extraction tasks, the study reveals that model architecture and prompt/reference configuration often drive performance more than sheer size, with context-grounded approaches yielding benefits in specific configurations. It also uncovers that simply aggregating external references can harm performance, underscoring the need for careful reference curation and alignment strategies. The authors provide open-source code to support reproducibility and further validation, highlighting both the potential and current limitations of grounding automated DSM generation in verified domain knowledge for CPS architecture design.
Abstract
We explore the potential of Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Graph-based RAG (GraphRAG) for generating Design Structure Matrices (DSMs). We test these methods on two distinct use cases -- a power screwdriver and a CubeSat with known architectural references -- evaluating their performance on two key tasks: determining relationships between predefined components, and the more complex challenge of identifying components and their subsequent relationships. We measure the performance by assessing each element of the DSM and overall architecture. Despite design and computational challenges, we identify opportunities for automated DSM generation, with all code publicly available for reproducibility and further feedback from the domain experts.
