Learning Electromagnetic Metamaterial Physics With ChatGPT
Darui Lu, Yang Deng, Jordan M. Malof, Willie J. Padilla
TL;DR
This work investigates whether a fine-tuned ChatGPT-3.5 can learn the physics of all-dielectric metamaterials by predicting the absorptivity spectrum from a 14-parameter unit-cell geometry. By encoding geometry and spectra as text, the authors train FT-LLM to perform forward predictions and compare it against traditional ML models across varying dataset sizes, finding that FT-LLM achieves competitive mean absolute relative error ($\text{MARE}$) with deep neural networks and, with larger datasets, comparable mean squared error ($\text{MSE}$). The study further probes interpretability and inverse design, finding that FT-LLM can provide textual explanations but does not outperform baselines in interpretability or inverse-design performance, especially when handling larger outputs. These results suggest LLMs can serve as data-efficient surrogates for forward metamaterial physics and data summarization, though practical deployment for design tasks requires further advances in reliability and inverse-design capabilities. Overall, the paper highlights the potential and current limitations of applying large language models to physics-based metamaterial research.
Abstract
Large language models (LLMs) such as ChatGPT, Gemini, LlaMa, and Claude are trained on massive quantities of text parsed from the internet and have shown a remarkable ability to respond to complex prompts in a manner often indistinguishable from humans. For all-dielectric metamaterials consisting of unit cells with four elliptical resonators, we present a LLM fine-tuned on up to 40,000 data that can predict the absorptivity spectrum given a text prompt that only specifies the metasurface geometry. Results are compared to conventional machine learning approaches including feed-forward neural networks, random forest, linear regression, and K-nearest neighbor (KNN). Remarkably, the fine-tuned LLM (FT-LLM) achieves a comparable performance across large dataset sizes with a deep neural network. We also explore inverse problems by asking the LLM to predict the geometry necessary to achieve a desired spectrum. LLMs possess several advantages over humans that may give them benefits for research, including the ability to process enormous amounts of data, find hidden patterns in data, and operate in higher-dimensional spaces. This suggests they may be able to leverage their general knowledge of the world to learn faster from training data than traditional models, making them valuable tools for research and analysis.
