Sequencing Silicates in the IRS Debris Disk Catalog I: Methodology for Unsupervised Clustering
Cicero X. Lu, Tushar Mittal, Christine H. Chen, Alexis Y. Li, Kadin Worthen, B. A. Sargent, Carey M. Lisse, G. C. Sloan, Dean C. Hines, Dan M. Watson, Isabel Rebollido, Bin B. Ren, Joel D. Green
TL;DR
The paper presents CLUES, a non-parametric, fully interpretable unsupervised framework for clustering Spitzer IRS debris-disk spectra to reveal mineralogical groupings. It combines a rigorous preprocessing pipeline (photosphere subtraction, emissivity normalization, continuum modeling, binning) with the Sequencer distance-based workflow, enabling MST and hierarchical clustering driven by a distance scale and the Earth-Mover Distance. The approach is demonstrated on a forsterite spectral library, a meteorite ensemble, and a debris-disk spectrum, laying groundwork for broader mineralogical demographics and follow-up studies with JWST and other observatories. By enabling objective, scalable extraction of end-member spectra, CLUES advances our understanding of debris-disk composition and its links to planetary formation processes, with potential applicability to protoplanetary disks and remote-sensing spectroscopy.
Abstract
Debris disks, which consist of dust, planetesimals, planets, and gas, offer a unique window into the mineralogical composition of their parent bodies, especially during the critical phase of terrestrial planet formation spanning 10 to a few hundred million years. Observations from the $\textit{Spitzer}$ Space Telescope have unveiled thousands of debris disks, yet systematic studies remain scarce, let alone those with unsupervised clustering techniques. This study introduces $\texttt{CLUES}$ (CLustering UnsupErvised with Sequencer), a novel, non-parametric, fully-interpretable machine-learning spectral analysis tool designed to analyze and classify the spectral data of debris disks. $\texttt{CLUES}$ combines multiple unsupervised clustering methods with multi-scale distance measures to discern new groupings and trends, offering insights into compositional diversity and geophysical processes within these disks. Our analysis allows us to explore a vast parameter space in debris disk mineralogy and also offers broader applications in fields such as protoplanetary disks and solar system objects. This paper details the methodology, implementation, and initial results of $\texttt{CLUES}$, setting the stage for more detailed follow-up studies focusing on debris disk mineralogy and demographics.
