Foundational Large Language Models for Materials Research

Vaibhav Mishra; Somaditya Singh; Dhruv Ahlawat; Mohd Zaki; Vaibhav Bihani; Hargun Singh Grover; Biswajit Mishra; Santiago Miret; Mausam; N. M. Anoop Krishnan

Foundational Large Language Models for Materials Research

Vaibhav Mishra, Somaditya Singh, Dhruv Ahlawat, Mohd Zaki, Vaibhav Bihani, Hargun Singh Grover, Biswajit Mishra, Santiago Miret, Mausam, N. M. Anoop Krishnan

TL;DR

LLaMat introduces domain-adapted foundation models for materials science by three-stage development: continued pretraining on a materials-focused corpus with CIF data, followed by instruction tuning and task-specific finetuning. The two main variants, LLaMat-Chat and LLaMat-CIF, demonstrate strong performance across MatNLP, MatSIE, and crystal-structure generation benchmarks, often outperforming larger general-purpose LLMs. A notable finding is 'adaptation rigidity,' where some larger pre-trained models (e.g., LLaMA-3) underperform relative to smaller, domain-adapted counterparts on certain tasks, highlighting the nuanced relationship between pretraining scale and domain adaptation. Together, these results support the feasibility of deployable, specialized AI copilots for materials research and offer guidance on model selection, training methodology, and domain-specific performance considerations for scientific AI systems.

Abstract

Materials discovery and development are critical for addressing global challenges. Yet, the exponential growth in materials science literature comprising vast amounts of textual data has created significant bottlenecks in knowledge extraction, synthesis, and scientific reasoning. Large Language Models (LLMs) offer unprecedented opportunities to accelerate materials research through automated analysis and prediction. Still, their effective deployment requires domain-specific adaptation for understanding and solving domain-relevant tasks. Here, we present LLaMat, a family of foundational models for materials science developed through continued pretraining of LLaMA models on an extensive corpus of materials literature and crystallographic data. Through systematic evaluation, we demonstrate that LLaMat excels in materials-specific NLP and structured information extraction while maintaining general linguistic capabilities. The specialized LLaMat-CIF variant demonstrates unprecedented capabilities in crystal structure generation, predicting stable crystals with high coverage across the periodic table. Intriguingly, despite LLaMA-3's superior performance in comparison to LLaMA-2, we observe that LLaMat-2 demonstrates unexpectedly enhanced domain-specific performance across diverse materials science tasks, including structured information extraction from text and tables, more particularly in crystal structure generation, a potential adaptation rigidity in overtrained LLMs. Altogether, the present work demonstrates the effectiveness of domain adaptation towards developing practically deployable LLM copilots for materials research. Beyond materials science, our findings reveal important considerations for domain adaptation of LLMs, such as model selection, training methodology, and domain-specific performance, which may influence the development of specialized scientific AI systems.

Foundational Large Language Models for Materials Research

TL;DR

Abstract

Foundational Large Language Models for Materials Research

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)