Challenging Assumptions in Learning Generic Text Style Embeddings
Phil Ostheimer, Marius Kloft, Sophie Fellenz
TL;DR
This work investigates whether generic, sentence‑level text style embeddings can be learned by assuming that low‑level stylistic changes compose high‑level styles. The authors fine‑tune a general text encoder on StylePTB using contrastive and cross‑entropy objectives to produce style‑discriminative representations and evaluate transfer to multiple high‑level style datasets. Results show that cross‑entropy fine‑tuning yields selective improvements, while contrastive learning often degrades performance, challenging the central assumption and indicating that current contrastive setups may misalign with style learning. The study highlights the need for more nuanced training signals and dataset diversity when pursuing generalizable style representations with practical implications for style classification and transfer tasks.
Abstract
Recent advancements in language representation learning primarily emphasize language modeling for deriving meaningful representations, often neglecting style-specific considerations. This study addresses this gap by creating generic, sentence-level style embeddings crucial for style-centric tasks. Our approach is grounded on the premise that low-level text style changes can compose any high-level style. We hypothesize that applying this concept to representation learning enables the development of versatile text style embeddings. By fine-tuning a general-purpose text encoder using contrastive learning and standard cross-entropy loss, we aim to capture these low-level style shifts, anticipating that they offer insights applicable to high-level text styles. The outcomes prompt us to reconsider the underlying assumptions as the results do not always show that the learned style representations capture high-level text styles.
