Are Emergent Abilities in Large Language Models just In-Context Learning?
Sheng Lu, Irina Bigoulaeva, Rachneet Sachdeva, Harish Tayyar Madabushi, Iryna Gurevych
TL;DR
This paper scrutinizes so-called emergent abilities in large language models and argues that, once in-context learning ($ICL$) is removed, these abilities largely disappear. Through over 1000 experiments across 20 models, 22 tasks, and multiple prompting settings, the authors show that instruction-tuned performance largely stems from an implicit $ICL$ mechanism rather than intrinsic functional linguistic capabilities. They demonstrate a substantial overlap between tasks solvable by non-instruction-tuned models in few-shot settings and instruction-tuned models in zero-shot settings, supporting the view that instruction-tuning enables $ICL$-like behavior. The work provides a theoretical framework for interpreting LLM capabilities, emphasizes safer and more efficient usage, and challenges the notion that scaling alone yields genuinely new abilities beyond prompting dynamics and memory.
Abstract
Large language models, comprising billions of parameters and pre-trained on extensive web-scale corpora, have been claimed to acquire certain capabilities without having been specifically trained on them. These capabilities, referred to as "emergent abilities," have been a driving force in discussions regarding the potentials and risks of language models. A key challenge in evaluating emergent abilities is that they are confounded by model competencies that arise through alternative prompting techniques, including in-context learning, which is the ability of models to complete a task based on a few examples. We present a novel theory that explains emergent abilities, taking into account their potential confounding factors, and rigorously substantiate this theory through over 1000 experiments. Our findings suggest that purported emergent abilities are not truly emergent, but result from a combination of in-context learning, model memory, and linguistic knowledge. Our work is a foundational step in explaining language model performance, providing a template for their efficient use and clarifying the paradox of their ability to excel in some instances while faltering in others. Thus, we demonstrate that their capabilities should not be overestimated.
