Reinforcement Learning on Reconfigurable Hardware: Overcoming Material Variability in Laser Material Processing
Giulio Masinelli, Chang Rajani, Patrik Hoffmann, Kilian Wasmer, David Atienza
TL;DR
Problem: variability in laser welding caused by material properties and surface conditions degrades weld quality. Approach: a real-time reinforcement learning controller implemented on an FPGA drives laser power using two optical sensor inputs, with server-side Soft Actor-Critic training and a simple reward based on the optical reflection signal $r(s_{t+1}) = \frac{OR(s_{t+1})}{10}$. Contributions: real-time closed-loop control on FPGA enabling microsecond-scale reactions, autonomous adaptation to surface variation without prior tuning, validation on 316L stainless steel samples across brushed, sandblasted, and mixed surfaces, and post-fabrication imaging showing larger melt pools and reduced porosity compared with constant power strategies. Significance: demonstrates rapid, automated optimization for high-speed manufacturing and suggests routes to extend to other laser processes using richer sensing and domain randomization.
Abstract
Ensuring consistent processing quality is challenging in laser processes due to varying material properties and surface conditions. Although some approaches have shown promise in solving this problem via automation, they often rely on predetermined targets or are limited to simulated environments. To address these shortcomings, we propose a novel real-time reinforcement learning approach for laser process control, implemented on a Field Programmable Gate Array to achieve real-time execution. Our experimental results from laser welding tests on stainless steel samples with a range of surface roughnesses validated the method's ability to adapt autonomously, without relying on reward engineering or prior setup information. Specifically, the algorithm learned the correct power profile for each unique surface characteristic, demonstrating significant improvements over hand-engineered optimal constant power strategies -- up to 23% better performance on rougher surfaces and 7% on mixed surfaces. This approach represents a significant advancement in automating and optimizing laser processes, with potential applications across multiple industries.
