Machine Phenomenology: A Simple Equation Classifying Fast Radio Bursts
Yang Liu, Yuhao Lu, Rahim Moradi, Bo Yang, Bing Zhang, Wenbin Lin, Yu Wang
TL;DR
This work tackles identifying whether fast radio bursts originate from two distinct physical classes by deriving simple, interpretable equations from CHIME FRB observables. It couples human-guided feature selection and Buckingham π-based dimensionless grouping with symbolic regression, yielding two classification approaches: a power-law multiplier and Neural Dimensionless Regression (NDR). The Power-Law Model achieves high accuracy on Catalog 1 but shows limited generalizability, while the NDR approach produces a stable, dimensionally consistent equation that partitions FRBs into two Gaussian populations and generalizes well to Catalog 2. The results suggest two underlying FRB processes with distinct spectral-temporal traits and demonstrate a principled framework for physics-informed, interpretable discovery in astrophysical data.
Abstract
This work shows how human physical reasoning can guide machine-driven symbolic regression toward discovering empirical laws from observations. As an example, we derive a simple equation that classifies fast radio bursts (FRBs) into two distinct Gaussian distributions, indicating the existence of two physical classes. This human-AI workflow integrates feature selection, dimensional analysis, and symbolic regression: deep learning first analyzes CHIME Catalog 1 and identifies six independent parameters that collectively provide a complete description of FRBs; guided by Buckingham-$π$ analysis and correlation analysis, humans then construct dimensionless groups; finally, symbolic regression performed by the machine discovers the governing equation. When applied to the newer CHIME Catalog, the equation produces consistent results, demonstrating that it captures the underlying physics. This framework is applicable to a broad range of scientific domains.
