Large Language Models for Scientific Idea Generation: A Creativity-Centered Survey
Fatemeh Shahhosseini, Arash Marioriyad, Ali Momen, Mahdieh Soleymani Baghshah, Mohammad Hossein Rohban, Shaghayegh Haghjooy Javanmard
TL;DR
This paper surveys Large Language Models (LLMs) for scientific idea generation, emphasizing the dual goals of novelty and valueness. It organizes methods into five families—knowledge augmentation, prompt-driven creativity, inference-time search, collaborative multi-agent systems, and parameter-level adaptations—and interprets them through Boden's combinatorial/exploratory/transformational creativity and Rhodes' 4Ps (Person, Process, Press, Product). The work highlights how grounding, prompting, search-time reasoning, agent collaboration, and training-time alignment collectively advance creative scientific ideation, while also identifying evaluation bottlenecks and gaps in standard benchmarks. It argues that current progress largely achieves combinatorial and exploratory creativity, with transformational shifts still elusive, and calls for open-ended benchmarks, richer simulators, and architectural innovations to push toward reliable, transformative AI-assisted discovery.
Abstract
Scientific idea generation lies at the heart of scientific discovery and has driven human progress-whether by solving unsolved problems or proposing novel hypotheses to explain unknown phenomena. Unlike standard scientific reasoning or general creative generation, idea generation in science is a multi-objective and open-ended task, where the novelty of a contribution is as essential as its empirical soundness. Large language models (LLMs) have recently emerged as promising generators of scientific ideas, capable of producing coherent and factual outputs with surprising intuition and acceptable reasoning, yet their creative capacity remains inconsistent and poorly understood. This survey provides a structured synthesis of methods for LLM-driven scientific ideation, examining how different approaches balance creativity with scientific soundness. We categorize existing methods into five complementary families: External knowledge augmentation, Prompt-based distributional steering, Inference-time scaling, Multi-agent collaboration, and Parameter-level adaptation. To interpret their contributions, we employ two complementary frameworks: Boden's taxonomy of Combinatorial, Exploratory and Transformational creativity to characterize the level of ideas each family expected to generate, and Rhodes' 4Ps framework-Person, Process, Press, and Product-to locate the aspect or source of creativity that each method emphasizes. By aligning methodological advances with creativity frameworks, this survey clarifies the state of the field and outlines key directions toward reliable, systematic, and transformative applications of LLMs in scientific discovery.
