A Survey of Deep Learning for Scientific Discovery

Maithra Raghu; Eric Schmidt

A Survey of Deep Learning for Scientific Discovery

Maithra Raghu, Eric Schmidt

TL;DR

This survey maps the landscape of deep learning for scientific discovery, detailing models, tasks, and training paradigms across visuals, sequences, and graphs while emphasizing data efficiency and interpretability for scientific use. It presents a practical workflow and a taxonomy of methods, including self- and semi-supervised learning, transfer learning, and domain adaptation, complemented by extensive resources and tutorials to accelerate adoption. By synthesizing standard architectures with domain-specific considerations and providing implementation tips, the paper offers a concrete guide for scientists to select promising approaches and avoid common pitfalls. The work highlights community-driven assets, such as pretrained models and open-source tools, that enable rapid ramp-up and reproducible research in diverse scientific domains.

Abstract

Over the past few years, we have seen fundamental breakthroughs in core problems in machine learning, largely driven by advances in deep neural networks. At the same time, the amount of data collected in a wide array of scientific domains is dramatically increasing in both size and complexity. Taken together, this suggests many exciting opportunities for deep learning applications in scientific settings. But a significant challenge to this is simply knowing where to start. The sheer breadth and diversity of different deep learning techniques makes it difficult to determine what scientific problems might be most amenable to these methods, or which specific combination of methods might offer the most promising first approach. In this survey, we focus on addressing this central issue, providing an overview of many widely used deep learning models, spanning visual, sequential and graph structured data, associated tasks and different training methods, along with techniques to use deep learning with less data and better interpret these complex models --- two central considerations for many scientific use cases. We also include overviews of the full design process, implementation tips, and links to a plethora of tutorials, research summaries and open-sourced deep learning pipelines and pretrained models, developed by the community. We hope that this survey will help accelerate the use of deep learning across different scientific domains.

A Survey of Deep Learning for Scientific Discovery

TL;DR

Abstract

A Survey of Deep Learning for Scientific Discovery

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)