Accelerating kinetic plasma simulations with machine learning generated initial conditions
Andrew T. Powis, Domenica Corona Rivera, Alexander Khrabry, Igor D. Kaganovich
TL;DR
This work addresses the computational bottleneck of reaching quasi-steady states in kinetic plasma simulations by introducing machine-learning–generated initial conditions (ICGs) that steer PIC simulations toward convergence more quickly. It develops and evaluates three ML approaches—MLP for ion density, PCA+MLP, and CNN for IVDF—trained on 1D-3V CCP simulations across varying frequency and pressure, and defines robust offline and online convergence criteria. The CNN-based IVDF ICG achieves the highest acceleration, with up to 17.1x offline and 4.4x online speedups, while PCA+MLP and MLP offer substantial but smaller gains, all while preserving final converged states. A workflow for iterative data-driven model improvement and digital-twin generation is outlined, highlighting practical implications for rapid design optimization and real-time control in CCP-based microelectronics manufacturing. The results demonstrate a viable path to hybrid ML+HPC acceleration of multi-time-scale plasma simulations and motivate further development toward larger parameter spaces and higher-fidelity, multi-dimensional simulations.
Abstract
Computer aided engineering of multi-time-scale plasma systems which exhibit a quasi-steady state solution are challenging due to the large number of time steps required to reach convergence. Machine learning techniques combined with traditional first-principles simulations and high-performance computing offer many interesting pathways towards resolving this challenge. We consider acceleration of kinetic plasma simulations via machine learning generated initial conditions. The approach is demonstrated through modeling of capacitively coupled plasma discharges relevant to the microelectronics industry. Three models are trained on simulations across a parameter space of device driving frequency and operating pressure. The models incorporate elements of a multi-layer perceptron, principal component analysis, and convolutional neural networks to predict the final time-averaged profiles of ion-density and velocity distribution functions. These data-driven initial condition generators (ICGs) provide a mean speedup of 17.1x in convergence time, when measured using an offline procedure, or a 4.4x speedup with an online procedure, with convolutional neural networks leading to the best performance. The paper also outlines a workflow for continuous data-driven model improvement and simulation speedup, with the aim of generating sufficient data for full device digital twins.
