What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen, Gaël Varoquaux
TL;DR
<3-5 sentence high-level summary>This survey investigates the role of Small Models (SMs) in the era of Large Language Models (LLMs) by analyzing how SMs can collaborate with LLMs to improve efficiency, robustness, and interpretability, and how LLMs can, in turn, enhance SMs via distillation and data synthesis. It presents a dual framework of Collaboration and Competition, detailing concrete methods in data curation, efficient inference, augmented reasoning, distillation, and synthetic data generation, as well as deployment considerations across computation-, task-, and interpretability-constrained environments. The work emphasizes practical strategies—such as data-quality focused curation, model cascading/routing, Retrieval Augmented Generation, and weak-to-strong supervision—that enable resource-constrained practitioners to deploy effective AI systems. By outlining actionable directions and pitfalls, it highlights the ecological niche of SMs as cost-effective, interpretable allies that synergize with LLMs to enable scalable, responsible AI deployments.
Abstract
Large Language Models (LLMs) have made significant progress in advancing artificial general intelligence (AGI), leading to the development of increasingly large models such as GPT-4 and LLaMA-405B. However, scaling up model sizes results in exponentially higher computational costs and energy consumption, making these models impractical for academic researchers and businesses with limited resources. At the same time, Small Models (SMs) are frequently used in practical settings, although their significance is currently underestimated. This raises important questions about the role of small models in the era of LLMs, a topic that has received limited attention in prior research. In this work, we systematically examine the relationship between LLMs and SMs from two key perspectives: Collaboration and Competition. We hope this survey provides valuable insights for practitioners, fostering a deeper understanding of the contribution of small models and promoting more efficient use of computational resources. The code is available at https://github.com/tigerchen52/role_of_small_models
