Auto-ICL: In-Context Learning without Human Supervision
Jinghan Yang, Shuming Ma, Furu Wei
TL;DR
This work tackles the bottleneck of human-crafted in-context learning by introducing Automatic In-Context Learning (Auto-ICL), where the model itself generates demonstrations and instructions to solve problems. It formalizes two generation modes (retrieving vs generating) and two forms of instructional content (1-to-1 and N-to-1), enabling flexible construction of problem-solving context. Empirical results across diverse reasoning domains and multiple models show that Auto-ICL-generated contexts can outperform human-annotated contexts and existing self-generation approaches like Zero-CoT and Auto-CoT, with nuanced trade-offs between efficiency and accuracy depending on the retrieval setup. The framework reduces reliance on human labeling, expands applicability to tasks challenging for humans, and highlights the value of combining demonstrations with high-level instructions for robust reasoning across datasets such as Theory of Mind, symbolic reasoning, arithmetic, and others.
Abstract
With in-context learning ability, the performance of large language models can be significantly boosted when provided with appropriate context. However, existing in-context learning methods mainly rely on human-provided contexts, such as labeled examples and explicit instructions. Writing context by humans is labor-intensive on various tasks and limits the model to tasks manageable by humans. To overcome these limitations, we propose Automatic In-Context Learning framework that enables the model to autonomously generate examples and instructions for problem-solving. With experiments across various models and datasets, results show that model-generated contexts outperform human-annotated contexts, including Few-Shot and Few-Shot-CoT methods, and surpass existing self-generated context methods like Zero-CoT and Auto-CoT.
