Neural network ensemble for computing cross sections for rotational transitions in H$_{2}$O + H$_{2}$O collisions
Bikramaditya Mandal, Dmitri Babikov, Phillip C. Stancil, Robert C. Forrey, Roman V. Krems, Naduvalath Balakrishnan
TL;DR
The paper tackles the computational bottleneck of obtaining rotationally inelastic cross sections for H$_2$O+H$_2$O collisions by developing a neural-network ensemble trained on MQCT data, covering 12 quantum-number inputs and both para/ortho symmetries. By transforming targets to $\log_{10}$ and carefully curating training data around the energy-gap $\Delta E$, the authors construct multiple NN models that interpolate state-to-state cross sections across collision energies. The approach achieves an average relative error around $4\times10^{-1}$ for cross sections (RMSE) and maintains thermally averaged cross sections accurate to roughly $13$–$14\%$ of MQCT values, while reducing computational costs by about a factor of 50. This enables rapid expansion of collision-rate databases for astrophysical modeling and is demonstrated to be robust enough to extend to other complex molecular systems and datasets.
Abstract
Water (H$_2$O) is one of the most abundant molecules in the universe and is found in a wide variety of astrophysical environments. Rotational transitions in H$_2$O + H$_2$O collisions are important in modeling environments rich in water molecules but they are computationally intractable using quantum mechanical methods. Here, we present a machine learning (ML) tool using an ensemble of neural networks (NNs) to predict cross sections to construct a database of rate coefficients for rotationally inelastic transitions in collisions of complex molecules such as water. The proposed methodology utilizes data computed with a mixed quantum-classical theory (MQCT). We illustrate that efficient ML models using NN can be built to accurately interpolate in the space of 12 quantum numbers for rotational transitions in two asymmetric top molecules, spanning both initial and final states. We examine various architectures of data corresponding to each collision energy, symmetry of water molecule, and excitation/de-excitation rotational transitions, and optimize the training/validation data sets. Using only about 10\% of the computed data for training, the NNs predict cross sections of state-to-state rotational transitions of H$_{2}$O + H$_{2}$O collision with average relative root mean square error of 0.409. Thermally averaged cross sections, computed using the predicted state-to-state cross sections ($\sim$90\%) and the data used for training and validation ($\sim$10\%) were compared against those obtained entirely from MQCT calculations. The agreement is found to be excellent with an average percent deviation of about $\sim$13.5\%. The methodology is robust, and thus, applicable to other complex molecular systems.
