Self-supervised feature distillation and design of experiments for efficient training of micromechanical deep learning surrogates
Patxi Fernandez-Zelaia, Jason Mayeur, Jiahao Cheng, Yousub Lee, Kevin Knipe, Kai Kadau
TL;DR
The paper addresses the challenge of efficiently training micromechanical deep learning surrogates by coupling intelligent microstructure representation with principled experimental design. It introduces three feature descriptors (VAE latent, self-supervised contrastive statistics, and classical GSH-based descriptors) and evaluates three space-filling designs (maximin, maxPro, and data twinning) within a CNN/U-net surrogate framework trained on elastic micromechanical simulations. Across a parametric study, the contrastive self-supervised features with a maximin-type design (contrastive cMm) offer robust improvements, with up to about 8% reduction in validation loss for moderate data sizes and reasonable stability across settings; data twinning provides a steady ~5% boost. The results demonstrate that careful experimental design and meaningful microstructure statistics substantially enhance surrogate performance and are likely to become more beneficial as problem sizes grow, enabling scalable micromechanical surrogates and informing broader image-based design tasks.
Abstract
Machine learning surrogate emulators are needed in engineering design and optimization tasks to rapidly emulate computationally expensive physics-based models. In micromechanics problems the local full-field response variables are desired at microstructural length scales. While there has been a great deal of work on establishing architectures for these tasks there has been relatively little work on establishing microstructural experimental design strategies. This work demonstrates that intelligent selection of microstructural volume elements for subsequent physics simulations enables the establishment of more accurate surrogate models. There exist two key challenges towards establishing a suitable framework: (1) microstructural feature quantification and (2) establishment of a criteria which encourages construction of a diverse training data set. Three feature extraction strategies are used as well as three design criteria. A novel contrastive feature extraction approach is established for automated self-supervised extraction of microstructural summary statistics. Results indicate that for the problem considered up to a 8\% improvement in surrogate performance may be achieved using the proposed design and training strategy. Trends indicate this approach may be even more beneficial when scaled towards larger problems. These results demonstrate that the selection of an efficient experimental design is an important consideration when establishing machine learning based surrogate models.
