Encoding and Understanding Astrophysical Information in Large Language Model-Generated Summaries
Kiera McCormick, Rafael Martínez-Galarza
TL;DR
The paper asks whether LLM-generated summaries can encode physical information derived from astrophysical measurements and how prompt design influences this encoding. It combines prompt engineering, LLM-derived summaries of Chandra X-ray source observations, and sparse autoencoders to probe the link between language and data-derived physics, showing that structured prompts significantly improve the alignment of textual embeddings with physical measurements. Notably, improvements in clustering purity for hardness ratio, power-law gamma, and variability index demonstrate that language can reflect underlying physics when prompts are carefully crafted, while SAEs reveal interpretable textual features associated with specific physical regimes. This work provides a pathway for extracting and interpreting physics from textual descriptions, enabling cross-modal analyses and informing future development of data-informed NLP tools in astrophysics.
Abstract
Large Language Models have demonstrated the ability to generalize well at many levels across domains, modalities, and even shown in-context learning capabilities. This enables research questions regarding how they can be used to encode physical information that is usually only available from scientific measurements, and loosely encoded in textual descriptions. Using astrophysics as a test bed, we investigate if LLM embeddings can codify physical summary statistics that are obtained from scientific measurements through two main questions: 1) Does prompting play a role on how those quantities are codified by the LLM? and 2) What aspects of language are most important in encoding the physics represented by the measurement? We investigate this using sparse autoencoders that extract interpretable features from the text.
