Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations
Milan Bhan, Jean-Noel Vittaut, Nicolas Chesneau, Marie-Jeanne Lesot
TL;DR
The paper tackles improving small language models by enabling them to generate and use their own rationales without human annotations or auxiliary proxy models. It introduces Self-AMPLIFY, a 3-step pipeline that selects informative samples, derives rationales via post hoc explanations applied to the SLM itself, and composes augmented ICL prompts for improved reasoning. Across five reasoning-heavy datasets and multiple 7B-scale SLMs, Self-AMPLIFY—especially with Ph-CoT rationales—consistently outperforms standard prompting and proxy-based baselines, demonstrating the viability of fully automated self-improvement. The work underscores the potential of post hoc explanations as self-improvement signals for autoregressive SLMs and points to future directions in rationale faithfulness, efficiency, and broader validation across models and tasks.
Abstract
Incorporating natural language rationales in the prompt and In-Context Learning (ICL) have led to a significant improvement of Large Language Models (LLMs) performance. However, generating high-quality rationales require human-annotation or the use of auxiliary proxy models. In this work, we propose Self-AMPLIFY to automatically generate rationales from post hoc explanation methods applied to Small Language Models (SLMs) to improve their own performance. Self-AMPLIFY is a 3-step method that targets samples, generates rationales and builds a final prompt to leverage ICL. Self-AMPLIFY performance is evaluated on four SLMs and five datasets requiring strong reasoning abilities. Self-AMPLIFY achieves good results against competitors, leading to strong accuracy improvement. Self-AMPLIFY is the first method to apply post hoc explanation methods to autoregressive language models to generate rationales to improve their own performance in a fully automated manner.
