Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning

Yongkang Liu; Xingle Xu; Ercong Nie; Zijing Wang; Shi Feng; Daling Wang; Qian Li; Hinrich Schütze

Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning

Yongkang Liu, Xingle Xu, Ercong Nie, Zijing Wang, Shi Feng, Daling Wang, Qian Li, Hinrich Schütze

TL;DR

This work theoretically and empirically analyzes the trade-offs between Parameter-Efficient Fine-Tuning (PEFT) and Full Fine-Tuning (FFT). It proves PEFT occupies a strict, low-dimensional subspace of the FFT parameter space and derives an upper bound on its representational capacity, along with a diminishing return on additional parameters, heightened sensitivity to perturbations, and smaller data-driven gains. Across 15 datasets and 11 adversarial test sets, FFT consistently outperforms PEFT on complex tasks and exhibits stronger robustness, while PEFT can match or exceed FFT on simpler, data-scarce scenarios. The findings suggest FFT as the generally more reliable choice when resources permit, with PEFT offering benefits primarily in low-data or resource-constrained settings, and they highlight theoretical avenues for improving PEFT reliability and performance.

Abstract

Parameter-Efficient Fine-Tuning (PEFT) methods achieve performance comparable to Full Fine-Tuning (FFT) while requiring significantly fewer computing resources, making it the go-to choice for researchers. We find that although PEFT can achieve competitive results on some benchmarks, its performance falls short of FFT in complex tasks, such as reasoning and instruction-based fine-tuning. In this paper, we compare the characteristics of PEFT and FFT in terms of representational capacity and robustness based on optimization theory. We theoretically demonstrate that PEFT is a strict subset of FFT. By providing theoretical upper bounds for PEFT, we show that the limited parameter space constrains the model's representational ability, making it more susceptible to perturbations. Experiments on 15 datasets encompassing classification, generation, reasoning, instruction fine-tuning tasks and 11 adversarial test sets validate our theories. We hope that these results spark further research beyond the realms of well established PEFT. The source code is in the anonymous Github repository\footnote{https://github.com/misonsky/PEFTEval}.

Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning

TL;DR

Abstract

Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (9)