Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

Xudong Han; Junjie Yang; Tianyang Wang; Ziqian Bi; Xinyuan Song; Junfeng Hao; Junhao Song

Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

Xudong Han, Junjie Yang, Tianyang Wang, Ziqian Bi, Xinyuan Song, Junfeng Hao, Junhao Song

TL;DR

This survey provides a comprehensive overview of the full pipeline of instruction tuning techniques, encompassing data collection methodologies, full-parameter and parameter-efficient fine-tuning strategies, and evaluation protocols, and discusses promising directions for automated data generation, adaptive optimization, and robust evaluation frameworks.

Abstract

Instruction tuning is a pivotal technique for aligning large language models (LLMs) with human intentions, safety constraints, and domain-specific requirements. This survey provides a comprehensive overview of the full pipeline, encompassing (i) data collection methodologies, (ii) full-parameter and parameter-efficient fine-tuning strategies, and (iii) evaluation protocols. We categorized data construction into three major paradigms: expert annotation, distillation from larger models, and self-improvement mechanisms, each offering distinct trade-offs between quality, scalability, and resource cost. Fine-tuning techniques range from conventional supervised training to lightweight approaches, such as low-rank adaptation (LoRA) and prefix tuning, with a focus on computational efficiency and model reusability. We further examine the challenges of evaluating faithfulness, utility, and safety across multilingual and multimodal scenarios, highlighting the emergence of domain-specific benchmarks in healthcare, legal, and financial applications. Finally, we discuss promising directions for automated data generation, adaptive optimization, and robust evaluation frameworks, arguing that a closer integration of data, algorithms, and human feedback is essential for advancing instruction-tuned LLMs. This survey aims to serve as a practical reference for researchers and practitioners seeking to design LLMs that are both effective and reliably aligned with human intentions.

Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

TL;DR

Abstract

Towards Alignment-Centric Paradigm: A Survey of Instruction Tuning in Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)