NEFTune: Noisy Embeddings Improve Instruction Finetuning
Neel Jain, Ping-yeh Chiang, Yuxin Wen, John Kirchenbauer, Hong-Min Chu, Gowthami Somepalli, Brian R. Bartoldson, Bhavya Kailkhura, Avi Schwarzschild, Aniruddha Saha, Micah Goldblum, Jonas Geiping, Tom Goldstein
TL;DR
NEFTune presents a simple yet powerful regularization technique for instruction fine-tuning by adding random embedding noise during training. This non-adversarial perturbation substantially improves conversational quality, achieving large gains on AlpacaEval and consistent improvements across diverse instruction datasets, including Evol-Instruct, ShareGPT, and OpenPlatypus; it also benefits RLHF-refined chat models and remains effective under QLORA. Analyses indicate reduced overfitting and longer, more detailed outputs without sacrificing text diversity or core capabilities. Human studies corroborate increased perceived quality, underscoring NEFTune's practical impact as a low-cost enhancement for instruction-tuning pipelines.
Abstract
We show that language model finetuning can be improved, sometimes dramatically, with a simple augmentation. NEFTune adds noise to the embedding vectors during training. Standard finetuning of LLaMA-2-7B using Alpaca achieves 29.79% on AlpacaEval, which rises to 64.69% using noisy embeddings. NEFTune also improves over strong baselines on modern instruction datasets. Models trained with Evol-Instruct see a 10% improvement, with ShareGPT an 8% improvement, and with OpenPlatypus an 8% improvement. Even powerful models further refined with RLHF such as LLaMA-2-Chat benefit from additional training with NEFTune.
