WRT-SAM: Foundation Model-Driven Segmentation for Generalized Weld Radiographic Testing
Yunyi Zhou, Kun Shi, Gang Hao
TL;DR
The paper presents elsarticle.cls, a LaTeX document class optimized for Elsevier journal submissions. It emphasizes minimizing package conflicts by preserving kernel interfaces and leveraging standard packages. The class offers multiple formatting options, including preprint and final styles (models 1+, 3+, and 5+), along with improved front matter handling and theorem environments, compatible with natbib and amsthm. Installation guidance via CTAN and Elsevier resources is provided, detailing build steps and local TEXMF deployment. Collectively, these features streamline manuscript preparation and ensure consistent, journal-ready formatting across submissions.
Abstract
Radiographic testing is a fundamental non-destructive evaluation technique for identifying weld defects and assessing quality in industrial applications due to its high-resolution imaging capabilities. Over the past decade, deep learning techniques have significantly advanced weld defect identification in radiographic images. However, conventional approaches, which rely on training small-scale, task-specific models on single-scenario datasets, exhibit poor cross-scenario generalization. Recently, the Segment Anything Model (SAM), a pre-trained visual foundation model trained on large-scale datasets, has demonstrated exceptional zero-shot generalization capabilities. Fine-tuning SAM with limited domain-specific data has yielded promising results in fields such as medical image segmentation and anomaly detection. To the best of our knowledge, this work is the first to introduce SAM-based segmentation for general weld radiographic testing images. We propose WRT-SAM, a novel weld radiographic defect segmentation model that leverages SAM through an adapter-based integration with a specialized prompt generator architecture. To improve adaptability to grayscale weld radiographic images, we introduce a frequency prompt generator module, which enhances the model's sensitivity to frequency-domain information. Furthermore, to address the multi-scale nature of weld defects, we incorporate a multi-scale prompt generator module, enabling the model to effectively extract and encode defect information across varying scales. Extensive experimental evaluations demonstrate that WRT-SAM achieves a recall of 78.87%, a precision of 84.04%, and an AUC of 0.9746, setting a new state-of-the-art (SOTA) benchmark. Moreover, the model exhibits superior zero-shot generalization performance, highlighting its potential for practical deployment in diverse radiographic testing scenarios.
