Table of Contents
Fetching ...

Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning

Hyundong Cho, Karishma Sharma, Nicolaas Jedema, Leonardo F. R. Ribeiro, Alessandro Moschitti, Ravi Krishnan, Jonathan May

TL;DR

The paper tackles the problem that large language models generalize toward a generic, collective voice, limiting personalization for individual users. It proposes Trial-Error-Explain In-Context Learning (TICL), a tuning-free method that personalizes text generation by expanding the in-context learning prompt with model-generated negative samples and explanations, avoiding any parameter updates. Empirical results across two author-style datasets show TICL achieves high win rates against the prior state-of-the-art and competitive baselines, with explanations contributing the largest gains. The approach front-loads computation to construct a user-specific prompt, offering a practical path to personalization in black-box LLM settings, albeit with increased test-time cost and dependence on long-context understanding.

Abstract

Language models are aligned to the collective voice of many, resulting in generic outputs that do not align with specific users' styles. In this work, we present Trial-Error-Explain In-Context Learning (TICL), a tuning-free method that personalizes language models for text generation tasks with fewer than 10 examples per user. TICL iteratively expands an in-context learning prompt via a trial-error-explain process, adding model-generated negative samples and explanations that provide fine-grained guidance towards a specific user's style. TICL achieves favorable win rates on pairwise comparisons with LLM-as-a-judge up to 91.5% against the previous state-of-the-art and outperforms competitive tuning-free baselines for personalized alignment tasks of writing emails, essays and news articles. Both lexical and qualitative analyses show that the negative samples and explanations enable language models to learn stylistic context more effectively and overcome the bias towards structural and formal phrases observed in their zero-shot outputs. By front-loading inference compute to create a user-specific in-context learning prompt that does not require extra generation steps at test time, TICL presents a novel yet simple approach for personalized alignment.

Tuning-Free Personalized Alignment via Trial-Error-Explain In-Context Learning

TL;DR

The paper tackles the problem that large language models generalize toward a generic, collective voice, limiting personalization for individual users. It proposes Trial-Error-Explain In-Context Learning (TICL), a tuning-free method that personalizes text generation by expanding the in-context learning prompt with model-generated negative samples and explanations, avoiding any parameter updates. Empirical results across two author-style datasets show TICL achieves high win rates against the prior state-of-the-art and competitive baselines, with explanations contributing the largest gains. The approach front-loads computation to construct a user-specific prompt, offering a practical path to personalization in black-box LLM settings, albeit with increased test-time cost and dependence on long-context understanding.

Abstract

Language models are aligned to the collective voice of many, resulting in generic outputs that do not align with specific users' styles. In this work, we present Trial-Error-Explain In-Context Learning (TICL), a tuning-free method that personalizes language models for text generation tasks with fewer than 10 examples per user. TICL iteratively expands an in-context learning prompt via a trial-error-explain process, adding model-generated negative samples and explanations that provide fine-grained guidance towards a specific user's style. TICL achieves favorable win rates on pairwise comparisons with LLM-as-a-judge up to 91.5% against the previous state-of-the-art and outperforms competitive tuning-free baselines for personalized alignment tasks of writing emails, essays and news articles. Both lexical and qualitative analyses show that the negative samples and explanations enable language models to learn stylistic context more effectively and overcome the bias towards structural and formal phrases observed in their zero-shot outputs. By front-loading inference compute to create a user-specific in-context learning prompt that does not require extra generation steps at test time, TICL presents a novel yet simple approach for personalized alignment.

Paper Structure

This paper contains 44 sections, 5 equations, 11 figures, 11 tables, 1 algorithm.

Figures (11)

  • Figure 1: Top: The author's text contains rhetorical questions and more colloquial and informal phrases. Middle: Few-shot in-context learning is insufficient for adapting to an author's style and model biases for formal phrases persist.Bottom:TICL helps overcome model biases and applies the author's style more consistently. Model outputs are with gpt-4o-2024-0806
  • Figure 2: TICL methodology overview. Instead of SFT, TICL starts with a few-shot ICL for behavior cloning. Then, TICL repeatedly generates an output for a task and a corresponding explanation that critiques the stylistic difference between the output and the user's text. If the output is considered not stylistically consistent, it and its explanation are added to the prompt.
  • Figure 3: Simplified illustration of our evaluation setup for pairwise comparison of stylistic similarity using LLM-as-a-judge. The full prompt is show in \ref{['fig:eval_template']}.
  • Figure 4: Prompt template $\mathcal{E}$ for generating explanations on the difference between the reference output and the generated output.
  • Figure 5: Prompt template $\mathcal{P}$ for the few-shot in-context learning baseline.
  • ...and 6 more figures