Fast Low Energy Reconstruction using Convolutional Neural Networks
IceCube Collaboration
TL;DR
This work addresses the challenge of reconstructing sub-100 GeV neutrino interactions in IceCube-DeepCore for atmospheric oscillations. It develops five CNNs with a two-branch input architecture to estimate energy, direction, and vertex, plus two classifiers for track-like vs cascade-like events and for atmospheric muon background, trained on GENIE and MuonGun MC. The CNNs achieve competitive or improved energy and zenith resolution and have GPU-accelerated inference enabling large-scale analyses, with runtimes far faster than traditional RETRO methods while delivering comparable performance. These reconstructions support higher effective event yields and feed into precise $\nu_\mu$ disappearance measurements, with ongoing work exploring azimuth reconstruction, uncertainty estimation, and graph neural network approaches for further gains.
Abstract
IceCube is a Cherenkov detector instrumenting over a cubic kilometer of glacial ice deep under the surface of the South Pole. The DeepCore sub-detector lowers the detection energy threshold to a few GeV, enabling the precise measurements of neutrino oscillation parameters with atmospheric neutrinos. The reconstruction of neutrino interactions inside the detector is essential in studying neutrino oscillations. It is particularly challenging to reconstruct sub-100 GeV events with the IceCube detectors due to the relatively sparse detection units and detection medium. Convolutional neural networks (CNNs) are broadly used in physics experiments for both classification and regression purposes. This paper discusses the CNNs developed and employed for the latest IceCube-DeepCore oscillation measurements. These CNNs estimate various properties of the detected neutrinos, such as their energy, direction of arrival, interaction vertex position, flavor-related signature, and are also used for background classification.
