In-Situ Melt Pool Characterization via Thermal Imaging for Defect Detection in Directed Energy Deposition Using Vision Transformers

Israt Zarin Era; Fan Zhou; Ahmed Shoyeb Raihan; Imtiaz Ahmed; Alan Abul-Haj; James Craig; Srinjoy Das; Zhichao Liu

In-Situ Melt Pool Characterization via Thermal Imaging for Defect Detection in Directed Energy Deposition Using Vision Transformers

Israt Zarin Era, Fan Zhou, Ahmed Shoyeb Raihan, Imtiaz Ahmed, Alan Abul-Haj, James Craig, Srinjoy Das, Zhichao Liu

TL;DR

Directed Energy Deposition melt-pool porosity detection is challenged by scarce labeled data. The authors propose a self-supervised learning pipeline using a Vision Transformer-based Masked Autoencoder (MAE) pretrained on unlabeled melt-pool images, then transfer-learned to two classifiers trained on a small labeled set. On in-situ thermal images from a custom DED setup, the approach achieves high accuracy (0.954–0.992) with an average accuracy of 98.2% and average F1 of 82.8% for the ViT-based classifier, demonstrating strong defect-detection performance with limited labels. The work highlights the practicality of SSL ViTs for scalable, low-labeled data quality control in DED melt-pool monitoring and points to future improvements with alternative losses and semi-supervised strategies.

Abstract

Directed Energy Deposition (DED) offers significant potential for manufacturing complex and multi-material parts. However, internal defects such as porosity and cracks can compromise mechanical properties and overall performance. This study focuses on in-situ monitoring and characterization of melt pools associated with porosity, aiming to improve defect detection and quality control in DED-printed parts. Traditional machine learning approaches for defect identification rely on extensive labeled datasets, often scarce and expensive to generate in real-world manufacturing. To address this, our framework employs self-supervised learning on unlabeled melt pool data using a Vision Transformer-based Masked Autoencoder (MAE) to produce highly representative embeddings. These fine-tuned embeddings are leveraged via transfer learning to train classifiers on a limited labeled dataset, enabling the effective identification of melt pool anomalies. We evaluate two classifiers: (1) a Vision Transformer (ViT) classifier utilizing the fine-tuned MAE Encoder's parameters and (2) the fine-tuned MAE Encoder combined with an MLP classifier head. Our framework achieves overall accuracy ranging from 95.44% to 99.17% and an average F1 score exceeding 80%, with the ViT Classifier slightly outperforming the MAE Encoder Classifier. This demonstrates the scalability and cost-effectiveness of our approach for automated quality control in DED, effectively detecting defects with minimal labeled data.

In-Situ Melt Pool Characterization via Thermal Imaging for Defect Detection in Directed Energy Deposition Using Vision Transformers

TL;DR

Abstract

In-Situ Melt Pool Characterization via Thermal Imaging for Defect Detection in Directed Energy Deposition Using Vision Transformers

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)