Improving In-Context Learning with Reasoning Distillation
Nafis Sadeq, Xin Xu, Zhouhang Xie, Julian McAuley, Byungkyu Kang, Prarit Lamba, Xiang Gao
TL;DR
This work tackles the challenge of inductive reasoning in in-context learning by proposing ReDis, a black-box reasoning distillation pipeline that uses a teacher model to generate candidate hypotheses, evaluates their fitness through noisy rule-following, and then applies supervised fine-tuning and preference alignment to produce a student capable of efficient hypothesis search. The method achieves substantial performance gains across four diverse inductive-reasoning tasks (List Function, 1D ARC, ACRE, MiniSCAN) on multiple backbones, and even matches or surpasses GPT-4o in some settings, while significantly reducing inference-time token costs. A key contribution is the ORPO alignment strategy, which optimizes a combined loss $\mathcal{L}=\mathcal{L}_{\mathrm{sft}}+\lambda\mathcal{L}_{\mathrm{or}}$ to favor high-quality rule generation within a smaller search space. The results demonstrate that distilling inductive reasoning rules, rather than just inputs and outputs, yields superior generalization to novel inputs and improves efficiency, suggesting practical benefits for deploying open-weight models in complex reasoning tasks. The work also highlights the importance of evaluating hypothesis quality via demonstrated rule satisfaction and offers a framework for data augmentation, SFT, and alignment that could extend to other reasoning-intensive domains.
Abstract
Language models rely on semantic priors to perform in-context learning, which leads to poor performance on tasks involving inductive reasoning. Instruction-tuning methods based on imitation learning can superficially enhance the in-context learning performance of language models, but they often fail to improve the model's understanding of the underlying rules that connect inputs and outputs in few-shot demonstrations. We propose ReDis, a reasoning distillation technique designed to improve the inductive reasoning capabilities of language models. Through a careful combination of data augmentation, filtering, supervised fine-tuning, and alignment, ReDis achieves significant performance improvements across a diverse range of tasks, including 1D-ARC, List Function, ACRE, and MiniSCAN. Experiments on three language model backbones show that ReDis outperforms equivalent few-shot prompting baselines across all tasks and even surpasses the teacher model, GPT-4o, in some cases. ReDis, based on the LLaMA-3 backbone, achieves relative improvements of 23.2%, 2.8%, and 66.6% over GPT-4o on 1D-ARC, ACRE, and MiniSCAN, respectively, within a similar hypothesis search space. The code, dataset, and model checkpoints will be made available at https://github.com/NafisSadeq/reasoning-distillation.git.
