Few-Shot Test-Time Optimization Without Retraining for Semiconductor Recipe Generation and Beyond
Shangding Gu, Donghao Ying, Ming Jin, Yu Joe Lu, Jun Wang, Javad Lavaei, Costas Spanos
TL;DR
The paper tackles the challenge of adapting deployed models or hardware without retraining. It introduces Model Feedback Learning (MFL), a two-loop input-optimization framework that uses a fixed emulator $\mathcal{E}$ and a learnable reverse emulator $\mathcal{R}$ to map targets $Z'$ to inputs $X'$ for a fixed model $\mathcal{M}$, with Loop A aligning $\mathcal{R}$ to $\mathcal{E}$ and Loop B refining toward $\mathcal{M}$. A conservative learning strategy based on model sensitivities $s_{\mathcal{E}}(x)$ and $s_{\mathcal{M}}(x)$ stabilizes updates, and domain randomization improves robustness. Empirically, MFL achieves target semiconductor plasma-etching recipes in as few as five iterations, outperforming Bayesian optimization and human experts, and it generalizes to chemical vapor deposition and wire bonding while maintaining robustness to process variability. The work demonstrates that retraining-free input optimization can be both data-efficient and broadly applicable to real-world manufacturing and control tasks.
Abstract
We introduce Model Feedback Learning (MFL), a novel test-time optimization framework for optimizing inputs to pre-trained AI models or deployed hardware systems without requiring any retraining of the models or modifications to the hardware. In contrast to existing methods that rely on adjusting model parameters, MFL leverages a lightweight reverse model to iteratively search for optimal inputs, enabling efficient adaptation to new objectives under deployment constraints. This framework is particularly advantageous in real-world settings, such as semiconductor manufacturing recipe generation, where modifying deployed systems is often infeasible or cost-prohibitive. We validate MFL on semiconductor plasma etching tasks, where it achieves target recipe generation in just five iterations, significantly outperforming both Bayesian optimization and human experts. Beyond semiconductor applications, MFL also demonstrates strong performance in chemical processes (e.g., chemical vapor deposition) and electronic systems (e.g., wire bonding), highlighting its broad applicability. Additionally, MFL incorporates stability-aware optimization, enhancing robustness to process variations and surpassing conventional supervised learning and random search methods in high-dimensional control settings. By enabling few-shot adaptation, MFL provides a scalable and efficient paradigm for deploying intelligent control in real-world environments.
