Detecting Moving Objects With Machine Learning
Wesley C. Fraser
TL;DR
The chapter surveys machine learning approaches to detecting moving objects in astronomical imagery, focusing on streak detection, moving-point sources in sequences, and shift-n-stack methods, with convolutional neural networks as the dominant tool and attention to overfitting. It juxtaposes classic non-ML pipelines (image differencing, tracklet linking, and digital tracking) with ML-enabled techniques across surveys and objects, including KBMOD, ATLAS, ZTF, and MOA applications. The discussion covers clustering-based filtering, direct-image CNN classifiers, and regression networks for photometry, emphasizing training data quality, augmentation, and cross-survey generalization, while highlighting practical metrics and potential failure modes. The analysis extends to big-data contexts such as Euclid and LSST, addressing non-linear shift-n-stack scalability and proposing architectural directions to maximize discovery depth, efficiency, and reliability in future Solar System surveys, with a call for robust validation practices. $O(t^4)$ is cited as the scaling for non-linear shift-n-stack with time baseline, underscoring the need for advanced ML solutions to harness the full potential of upcoming datasets.
Abstract
The scientific study of the Solar System's minor bodies ultimately starts with a search for those bodies. This chapter presents a review of the use of machine learning techniques to find moving objects, both natural and artificial, in astronomical imagery. After a short review of the classical non-machine learning techniques that are historically used, I review the relatively nascent machine learning literature, which can broadly be summarized into three categories: streak detection, detection of moving point sources in image sequences, and detection of moving sources in shift and stack searches. In most cases, convolutional neural networks are utilized, which is the obvious choice given the imagery nature of the inputs. In this chapter I present two example networks: a Residual Network I designed which is in use in various shift and stack searches, and a convolutional neural network that was designed for prediction of source brightnesses and their uncertainties in those same shift-stacks. In discussion of the literature and example networks, I discuss various pitfalls with the use of machine learning techniques, including a discussion on the important issue of overfitting. I discuss various pitfall associated with the use of machine learning techniques, and what I consider best practices to follow in the application of machine learning to a new problem, including methods for the creation of robust training sets, validation, and training to avoid overfitting.
