Workflow-Level Design Principles for Trustworthy GenAI in Automotive System Engineering

Chih-Hong Cheng; Brian Hsuan-Cheng Liao; Adam Molin; Hasan Esen

Workflow-Level Design Principles for Trustworthy GenAI in Automotive System Engineering

Chih-Hong Cheng, Brian Hsuan-Cheng Liao, Adam Molin, Hasan Esen

TL;DR

This work proposes workflow-level design principles for trustworthy GenAI integration and demonstrates them in an end-to-end automotive pipeline, from requirement delta identification to SysML v2 architecture update and re-testing.

Abstract

The adoption of large language models in safety-critical system engineering is constrained by trustworthiness, traceability, and alignment with established verification practices. We propose workflow-level design principles for trustworthy GenAI integration and demonstrate them in an end-to-end automotive pipeline, from requirement delta identification to SysML v2 architecture update and re-testing. First, we show that monolithic ("big-bang") prompting misses critical changes in large specifications, while section-wise decomposition with diversity sampling and lightweight NLP sanity checks improves completeness and correctness. Then, we propagate requirement deltas into SysML v2 models and validate updates via compilation and static analysis. Additionally, we ensure traceable regression testing by generating test cases through explicit mappings from specification variables to architectural ports and states, providing practical safeguards for GenAI used in safety-critical automotive engineering.

Workflow-Level Design Principles for Trustworthy GenAI in Automotive System Engineering

TL;DR

Abstract

Paper Structure (15 sections, 1 equation, 4 figures)

This paper contains 15 sections, 1 equation, 4 figures.

Introduction
Vision
Towards Trustworthy GenAI for System Engineering
Issues with GenAI
Design Principles for Trustworthy GenAI
Rigorous Workflow for Requirement Document Version Comparison
PDF Sectionization and Canonical Storage (A)
Decomposition into Neural and Classical Micro-Tasks (A,B)
Robustifying Neural Outputs via Diversification and Unification (C)
Designing Checkers (D)
Design Modification and Test Case Regeneration
From "big-bang" updates to incremental deltas (A, B).
Tool-supported validation for SysML v2 model updates (C, D).
Implications for test case regeneration under incremental design changes.
Conclusion

Figures (4)

Figure 1: GenAI-assisted workflow from requirement updates to architecture changes and verification.
Figure 2: A revised workflow for comparing complicated documents while increasing trustworthiness.
Figure 3: Precision and recall per model setting in retrieving relevant sections across ASPICE v3.1 and v4. Union refers to combining the predictions of Qwen3:32B, Nemotron3:30B, and GPT-OSS:20B.
Figure 4: Last-mile syntax drift in requirement allocation: :: replaced by ., breaking compilation.

Workflow-Level Design Principles for Trustworthy GenAI in Automotive System Engineering

TL;DR

Abstract

Workflow-Level Design Principles for Trustworthy GenAI in Automotive System Engineering

Authors

TL;DR

Abstract

Table of Contents

Figures (4)