Deep Learning in Astrophysics
Yuan-Sen Ting
TL;DR
The paper surveys how deep learning can complement classical statistics in astronomy by embedding physical priors and symmetries into neural architectures, enabling learning from massive but sparsely labeled data. It surveys methodological foundations (universal approximation, architecture-induced inductive biases, and physics-informed constraints), then details cross-cutting techniques (multi-scale surrogates, simulation-based inference, anomaly detection, foundation models, RL, and LLM-based agents). It emphasizes practical benefits—data efficiency, robustness to domain shifts, and field-level inference—while cautioning about black-box critiques, the need for rigorous validation, and the risks of overclaiming super-resolution or universal transfer. The review argues for a balanced, physics-aware integration of DL with traditional methods to enhance information extraction, simulation fidelity, and autonomous scientific workflows across observational and theoretical astrophysics, while outlining concrete directions for future development and evaluation.
Abstract
Deep learning has generated diverse perspectives in astronomy, with ongoing discussions between proponents and skeptics motivating this review. We examine how neural networks complement classical statistics, extending our data analytical toolkit for modern surveys. Astronomy offers unique opportunities through encoding physical symmetries, conservation laws, and differential equations directly into architectures, creating models that generalize beyond training data. Yet challenges persist as unlabeled observations number in billions while confirmed examples with known properties remain scarce and expensive. This review demonstrates how deep learning incorporates domain knowledge through architectural design, with built-in assumptions guiding models toward physically meaningful solutions. We evaluate where these methods offer genuine advances versus claims requiring careful scrutiny. - Neural architectures overcome trade-offs between scalability, expressivity, and data efficiency by encoding physical symmetries and conservation laws into network structure, enabling learning from limited labeled data. - Simulation-based inference and anomaly detection extract information from complex, non-Gaussian distributions where analytical likelihoods fail, enabling field-level cosmological analysis and systematic discovery of rare phenomena. - Multi-scale neural modeling bridges resolution gaps in astronomical simulations, learning effective subgrid physics from expensive high-fidelity runs to enhance large-volume calculations where direct computation remains prohibitive. - Emerging paradigms-reinforcement learning for telescope operations, foundation models learning from minimal examples, and large language model agents for research automation-show promise though are still developing in astronomical applications.
