Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies
Benjue Weng
TL;DR
This paper surveys how Transformer-based large language models are fine-tuned across paradigms and tasks, from in-context learning and chain-of-thought to agent-based and retrieval-augmented methods. It provides a comprehensive taxonomy of fine-tuning strategies, emphasizing parameter-efficient approaches such as LoRA, Prefix-, and Adapter-tuning, along with instruction- and alignment-tuning (RLHF, DPO, NLHF). Empirical results on six text-classification benchmarks highlight the competitiveness of PEFT techniques, particularly LoRA, while showing the impact of model size and data regime on performance. The work offers practical guidance for deploying scalable, efficient fine-tuning in industry and research, outlining challenges and future directions in this rapidly evolving landscape.
Abstract
With the surge of ChatGPT,the use of large models has significantly increased,rapidly rising to prominence across the industry and sweeping across the internet. This article is a comprehensive review of fine-tuning methods for large models. This paper investigates the latest technological advancements and the application of advanced methods in aspects such as task-adaptive fine-tuning,domain-adaptive fine-tuning,few-shot learning,knowledge distillation,multi-task learning,parameter-efficient fine-tuning,and dynamic fine-tuning.
