Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning

Xinyue Liu; Harshita Diddee; Daphne Ippolito

Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning

Xinyue Liu, Harshita Diddee, Daphne Ippolito

TL;DR

This paper uses the PEFT with Low-Rank Adaptation method to customize LLaMA-2 to ten different authors and shows that the generated text has lexical, syntactic, and surface alignment with the target author but struggles with content memorization.

Abstract

One-size-fits-all large language models (LLMs) are increasingly being used to help people with their writing. However, the style these models are trained to write in may not suit all users or use cases. LLMs would be more useful as writing assistants if their idiolect could be customized to match each user. In this paper, we explore whether parameter-efficient finetuning (PEFT) with Low-Rank Adaptation can effectively guide the style of LLM generations. We use this method to customize LLaMA-2 to ten different authors and show that the generated text has lexical, syntactic, and surface alignment with the target author but struggles with content memorization. Our findings highlight the potential of PEFT to support efficient, user-level customization of LLMs.

Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning

TL;DR

Abstract

Paper Structure (34 sections, 5 figures, 9 tables)

This paper contains 34 sections, 5 figures, 9 tables.

Introduction
Methods
StyleTunedLM
Baselines
Masking out Named Entities
Merging LoRA Modules
Experimental Design
Author Dataset
Evaluation Dataset
Evaluating Generation Style
Perplexity
Style-embedding Alignment
Linguistic Alignment
Experiment Results
Perplexity
...and 19 more sections

Figures (5)

Figure 1: Perplexity (PPL) comparison between pre-trained and fine-tuned models across different authors. The number on top of each set of bars indicates the reduction percentage in PPL after fine-tuning. Finetuned models achieve lower scores across all authors.
Figure 2: Average cosine similarity of baselines and our method between generations and average embeddings across all authors. StyleTunedLM archives the highest average similarity.
Figure 3: t-SNE on training, test, and generation of our method with pairwise loss.
Figure 4: Cosine similarity of baselines and our method between generation and the average embedding for each author. StyleTunedLM archives the highest similarities with most authors.
Figure 5: Confusion matrices of baselines and our method. StyleTunedLM achieves the highest classification accuracies across all authors.

Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning

TL;DR

Abstract

Customizing Large Language Model Generation Style using Parameter-Efficient Finetuning

Authors

TL;DR

Abstract

Table of Contents

Figures (5)