Mephisto: Self-Improving Large Language Model-Based Agents for Automated Interpretation of Multi-band Galaxy Observations

Zechang Sun; Yuan-Sen Ting; Yaobo Liang; Nan Duan; Song Huang; Zheng Cai

Mephisto: Self-Improving Large Language Model-Based Agents for Automated Interpretation of Multi-band Galaxy Observations

Zechang Sun, Yuan-Sen Ting, Yaobo Liang, Nan Duan, Song Huang, Zheng Cai

TL;DR

Mephisto introduces a large language model–driven multi‑agent framework that emulates human scientific reasoning to interpret multi‑band galaxy observations through SED modeling with CIGALE. By combining tree‑search exploration, temporal memory, and a distillable external knowledge base, it achieves near‑grid‑search accuracy using only ~1% of the models and demonstrates robust performance on both COSMOS2020 galaxies and JWST‑identified Little Red Dots, including frontier cases discovered after typical LLM knowledge cutoffs. Ablation studies show memory and knowledge components substantially improve fit quality and transfer across galaxies, while cross‑LLM evaluations indicate that cost‑effective backbones can be viable when guided by prior distilled knowledge. The work highlights a path toward transparent, AI‑augmented astronomical workflows capable of scaling to billions of sources, albeit with current limitations in autonomy, model scope, and computational cost that invite future improvements in both models and data infrastructure.

Abstract

Astronomical research has long relied on human expertise to interpret complex data and formulate scientific hypotheses. In this study, we introduce Mephisto -- a multi-agent collaboration framework powered by large language models (LLMs) that emulates human-like reasoning for analyzing multi-band galaxy observations. Mephisto interfaces with the CIGALE codebase (a library of spectral energy distribution, SED, models) to iteratively refine physical models against observational data. It conducts deliberate reasoning via tree search, accumulates knowledge through self-play, and dynamically updates its knowledge base. Validated across diverse galaxy populations -- including the James Webb Space Telescope's recently discovered "Little Red Dot" galaxies -- we show that Mephisto demonstrates proficiency in inferring the physical properties of galaxies from multi-band photometry, positioning it as a promising research copilot for astronomers. Unlike prior black-box machine learning approaches in astronomy, Mephisto offers a transparent, human-aligned reasoning process that integrates seamlessly with existing research practices. This work underscores the possibility of LLM-driven agent-based research for astronomy, establishes a foundation for fully automated, end-to-end artificial intelligence (AI)-powered scientific workflows, and unlocks new avenues for AI-augmented discoveries in astronomy.

Mephisto: Self-Improving Large Language Model-Based Agents for Automated Interpretation of Multi-band Galaxy Observations

TL;DR

Abstract

Mephisto: Self-Improving Large Language Model-Based Agents for Automated Interpretation of Multi-band Galaxy Observations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)