Disk mass predictions for binary neutron star mergers: limitations of proposed symbolic regression models
Francois Foucart
TL;DR
The paper challenges the claim that symbolic regression (SR) automatically yields superior and more robust disk-mass fits for binary neutron star mergers. It shows that the apparent SR advantage in Darc et al. largely arises from using a different error metric than the one underlying traditional fits, and that when evaluated with a common metric, SR performance is comparable to existing, physics-inspired fits. The analysis reveals that many SR formulas extrapolate unphysically outside the training region, especially at high lower-star compactness $C_1$, making them risky for parameter estimation pipelines. It argues that physics-informed fits, such as Lund 2025, remain safer with current data, though SR could become useful with more data and careful physics-based vetting for higher-dimensional models. The study uses training data from Kruger 2020 and testing data from Nedora 2020, emphasizes the role of parameter space coverage and error modeling, and ultimately advocates a cautious, multi-model approach to quantify modeling uncertainty in EM counterparts of mergers.
Abstract
Modeling disk formation and mass ejection in binary neutron star systems is an important component in the construction of models for the electromagnetic signals powered by these events. Most models rely on analytical formulae for the disk mass and dynamical ejecta that are fitted to the results of numerical simulations, yet these fits have large uncertainties that significantly limit our ability to extract information from merger observations. In a recent manuscript, Darc et al claim that disk mass formulae constructed using symbolic regression outperform existing formulae and robustly extend to regions of the parameter space outside of the fitting region. I show here that the improvement over the most directly comparable existing model comes mostly from the use of different error measures in optimizing the fitting parameters. For the limited training data used so far, that existing fitting formula has a performance similar to symbolic regression models when optimized over the same error measure. More importantly, I show that many of the formulae obtained through symbolic regression provide unphysical results when used over the whole range of parameters relevant to the modeling of binary neutron star mergers, making them dangerous to use within parameter estimation pipelines. I conclude that fitting formulae with more physics input (e.g. Lund et al 2025), albeit certainly imperfect, remain safer to use in data analysis than these symbolic regression results. Symbolic regression results used in conjunction with careful physics-based vetting may however outperform them in the future.
