From Engineering Diagrams to Graphs: Digitizing P&IDs with Transformers
Jan Marius Stürmer, Marius Graumann, Tobias Koch
TL;DR
This work tackles the challenge of digitizing P&ID diagrams by introducing a transformer-based Relationformer that jointly detects symbols and their interconnections, addressing limitations of modular pipelines. It provides PID2Graph, the first public dataset with real-world P&IDs annotated with full graph structures, enabling reproducible evaluation. Empirical results show the Relationformer outperforms the modular baseline across multiple datasets, with notable gains in node detection and edge prediction and robust performance on real-world diagrams. The study demonstrates the practical value of end-to-end relational modeling for complex engineering diagrams and outlines the importance of realistic training data and a standardized evaluation framework.
Abstract
Digitizing engineering diagrams like Piping and Instrumentation Diagrams (P&IDs) plays a vital role in maintainability and operational efficiency of process and hydraulic systems. Previous methods typically decompose the task into separate steps such as symbol detection and line detection, which can limit their ability to capture the structure in these diagrams. In this work, a transformer-based approach leveraging the Relationformer that addresses this limitation by jointly extracting symbols and their interconnections from P&IDs is introduced. To evaluate our approach and compare it to a modular digitization approach, we present the first publicly accessible benchmark dataset for P&ID digitization, annotated with graph-level ground truth. Experimental results on real-world diagrams show that our method significantly outperforms the modular baseline, achieving over 25% improvement in edge detection accuracy. This research contributes a reproducible evaluation framework and demonstrates the effectiveness of transformer models for structural understanding of complex engineering diagrams. The dataset is available under https://zenodo.org/records/14803338.
