Large Language Models for Superconductor Discovery
Suman Itani, Yibo Zhang, Ranjit Itani, Jiadong Zang
TL;DR
This work demonstrates an end-to-end workflow that uses large language models to (i) extract a comprehensive experimental database of superconductors from literature, (ii) fine-tune LLMs for superconductivity classification and $T_c$ regression, and (iii) perform inverse design to propose novel, chemically plausible superconducting compositions. The approach achieves competitive performance with traditional feature-based models, enables structure-aware predictions without hand-crafted descriptors, and produces novel materials and plausible candidates for experimental follow-up. By applying the trained predictors to external databases, the study showcases scalable discovery pipelines that surface unreported materials with $T_c$ above practical thresholds, highlighting the potential of LLM-driven data mining and materials design in superconductivity research. The work also identifies current limitations (e.g., CIF-based regression challenges, partial structural information) and outlines future paths, including multimodal fusion and physics-informed constraints, to further close the gap between language models and physics-driven materials predictions.
Abstract
Large language models (LLMs) offer new opportunities for automated data extraction and property prediction across materials science, yet their use in superconductivity research remains limited. Here we construct a large experimental database of 78,203 records, covering 19,058 unique compositions, extracted from scientific literature using an LLM-driven workflow. Each entry includes chemical composition, critical temperature, measurement pressure, structural descriptors, and critical fields. We fine-tune several open-source LLMs for three tasks: (i) classifying superconductors vs. non-superconductors, (ii) predicting the superconducting transition temperature directly from composition or structure-informed inputs, and (iii) inverse design of candidate compositions conditioned on target Tc. The fine-tuned LLMs achieve performance comparable to traditional feature-based models and in some cases exceed them, while substantially outperforming their base versions and capturing meaningful chemical and structural trends. The inverse-design model generates chemically plausible compositions, including 28% novel candidates not seen in training. Finally, applying the trained predictors to the GNoME database identifies unreported materials with predicted Tc > 10 K. Although unverified, these candidates illustrate how integrating an LLM-driven workflow can enable scalable hypothesis generation for superconductivity discovery.
