Teacher-Student Guided Inverse Modeling for Steel Final Hardness Estimation
Ahmad Alsheikh, Andreas Fischer
TL;DR
The paper tackles inverse steel heat-treatment planning by addressing the many-to-one mapping that links input conditions to final hardness. It introduces a Teacher-Student framework where a fixed forward Teacher predicts hardness from $13$ inputs, while an inverse Student learns to generate plausible input configurations from a target hardness through iterative supervision. On a public tempered steel dataset, the approach achieves high $R^2$ (≈0.98) and rapid convergence, outperforming baseline regressors and a TD3 reinforcement learning agent in both accuracy and training time. This method offers a data-efficient, interpretable route for inverse process modeling with potential for real-time control and extensions to uncertainty and diversity in the predicted inputs.
Abstract
Predicting the final hardness of steel after heat treatment is a challenging regression task due to the many-to-one nature of the process -- different combinations of input parameters (such as temperature, duration, and chemical composition) can result in the same hardness value. This ambiguity makes the inverse problem, estimating input parameters from a desired hardness, particularly difficult. In this work, we propose a novel solution using a Teacher-Student learning framework. First, a forward model (Teacher) is trained to predict final hardness from 13 metallurgical input features. Then, a backward model (Student) is trained to infer plausible input configurations from a target hardness value. The Student is optimized by leveraging feedback from the Teacher in an iterative, supervised loop. We evaluate our method on a publicly available tempered steel dataset and compare it against baseline regression and reinforcement learning models. Results show that our Teacher-Student framework not only achieves higher inverse prediction accuracy but also requires significantly less computational time, demonstrating its effectiveness and efficiency for inverse process modeling in materials science.
