Table of Contents
Fetching ...

Fully Convolutional Spatiotemporal Learning for Microstructure Evolution Prediction

Michael Trimboli, Mohammed Alsubaie, Sirani M. Perera, Ke-Gang Wang, Xianqi Li

TL;DR

This work establishes a robust baseline for spatiotemporal learning in materials science and offers a scalable, data-driven alternative for fast and reliable microstructure simulations.

Abstract

Understanding and predicting microstructure evolution is fundamental to materials science, as it governs the resulting properties and performance of materials. Traditional simulation methods, such as phase-field models, offer high-fidelity results but are computationally expensive due to the need to solve complex partial differential equations at fine spatiotemporal resolutions. To address this challenge, we propose a deep learning-based framework that accelerates microstructure evolution predictions while maintaining high accuracy. Our approach utilizes a fully convolutional spatiotemporal model trained in a self-supervised manner using sequential images generated from simulations of microstructural processes, including grain growth and spinodal decomposition. The trained neural network effectively learns the underlying physical dynamics and can accurately capture both short-term local behaviors and long-term statistical properties of evolving microstructures, while also demonstrating generalization to unseen spatiotemporal domains and variations in configuration and material parameters. Compared to recurrent neural architectures, our model achieves state-of-the-art predictive performance with significantly reduced computational cost in both training and inference. This work establishes a robust baseline for spatiotemporal learning in materials science and offers a scalable, data-driven alternative for fast and reliable microstructure simulations.

Fully Convolutional Spatiotemporal Learning for Microstructure Evolution Prediction

TL;DR

This work establishes a robust baseline for spatiotemporal learning in materials science and offers a scalable, data-driven alternative for fast and reliable microstructure simulations.

Abstract

Understanding and predicting microstructure evolution is fundamental to materials science, as it governs the resulting properties and performance of materials. Traditional simulation methods, such as phase-field models, offer high-fidelity results but are computationally expensive due to the need to solve complex partial differential equations at fine spatiotemporal resolutions. To address this challenge, we propose a deep learning-based framework that accelerates microstructure evolution predictions while maintaining high accuracy. Our approach utilizes a fully convolutional spatiotemporal model trained in a self-supervised manner using sequential images generated from simulations of microstructural processes, including grain growth and spinodal decomposition. The trained neural network effectively learns the underlying physical dynamics and can accurately capture both short-term local behaviors and long-term statistical properties of evolving microstructures, while also demonstrating generalization to unseen spatiotemporal domains and variations in configuration and material parameters. Compared to recurrent neural architectures, our model achieves state-of-the-art predictive performance with significantly reduced computational cost in both training and inference. This work establishes a robust baseline for spatiotemporal learning in materials science and offers a scalable, data-driven alternative for fast and reliable microstructure simulations.
Paper Structure (23 sections, 15 equations, 11 figures, 1 table)

This paper contains 23 sections, 15 equations, 11 figures, 1 table.

Figures (11)

  • Figure 1: Flowchart of the proposed fully convolutional spatiotemporal framework for microstructure evolution prediction. An input sequence of length $T$ is first processed by a convolutional spatial encoder to extract microstructural spatial correlations and project the data into a compact latent feature space. The latent representations are then passed through stacked Gated Spatiotemporal Attention (gSTA) modules, which serve as the temporal translator of the network and model the dynamic evolution patterns of the microstructure. Within this module, the encoded features are propagated and transformed to generate forecast frames corresponding to future time steps $T+1$ through $T+T'$. Finally, a convolutional spatial decoder reconstructs the predicted microstructure fields by upsampling the latent representations back to the original spatial resolution, producing the output sequence.
  • Figure 2: Grain growth prediction (10 input frames, 90 output frames) using the proposed fully convolutional spatiotemporal model: Two representative test samples are shown (indexed numerically). (A) Predicted and ground-truth microstructure frames at $t=11, 25, 50, 75,$ and $100$; (B) RMSE and SSIM evaluated every 5 frames over the prediction horizon; (C--D) Grain-size distributions (GSDs) for prediction and ground truth at $t=25$ and $t=100$, respectively.
  • Figure 3: Accuracy metrics on grain growth prediction (10 input frames, 90 output frames): (A) Dataset-averaged RMSE and SSIM evaluated every 5 frames over the prediction horizon; (B) Temporal evolution of the mean grain area for predicted and ground-truth sequences (computed every 10 frames); (C) Corresponding linear regression fits highlighting agreement in coarsening trends between prediction and ground truth.
  • Figure 4: Grain Growth prediction (10 input frames, 190 output frames) using the proposed fully convolutional spatiotemporal model: Two representative test samples are shown (indexed numerically). (A) Predicted and ground-truth microstructure frames at t = 11, 50, 100, 150, 200; (B) RMSE and SSIM evaluated every 5 frames over the prediction horizon; (C--D) Grain size distributions (GSD) for prediction and ground truth at t=50 and t=200, respectively.
  • Figure 5: Accuracy metrics on grain growth prediction (10 input frames, 190 output frames): (A) Dataset-averaged RMSE and SSIM evaluated every 5 frames over the prediction horizon; (B) Temporal evolution of the mean grain area for predicted and ground-truth sequences (computed every 10 frames); (C) Corresponding linear regression fits highlighting agreement in coarsening trends between prediction and ground truth.
  • ...and 6 more figures