Meta-Tuning LLMs to Leverage Lexical Knowledge for Generalizable Language Style Understanding
Ruohao Guo, Wei Xu, Alan Ritter
TL;DR
This work targets cross-style generalization in language style understanding by introducing lexicon-based instructions and meta-tuning of LLMs on source styles. The method leverages style lexicons to elicit latent stylistic knowledge, with class randomization to prevent memorization and promote genuine lexicon use. A benchmark of 13 established styles plus 63 LL-generated styles demonstrates consistent zero-shot gains across multiple models, and the approach improves few-shot stability by reducing sensitivity to example selection. The combined use of lexicons and meta-learning advances the practical ability to analyze and generate stylistic text, with potential applications in toxicity detection, readability assessment, and nuanced language analysis, while acknowledging limitations around lexicon quality, language scope, and dataset biases.
Abstract
Language style is often used by writers to convey their intentions, identities, and mastery of language. In this paper, we show that current large language models struggle to capture some language styles without fine-tuning. To address this challenge, we investigate whether LLMs can be meta-trained based on representative lexicons to recognize new styles they have not been fine-tuned on. Experiments on 13 established style classification tasks, as well as 63 novel tasks generated using LLMs, demonstrate that meta-training with style lexicons consistently improves zero-shot transfer across styles. We release the code and data at http://github.com/octaviaguo/Style-LLM .
