Towards Human Understanding of Paraphrase Types in Large Language Models
Dominik Meier, Jan Philip Wahle, Terry Ruas, Bela Gipp
TL;DR
The paper addresses the challenge of interpreting paraphrase variation by introducing Atomic Paraphrase Types (APTs) and two human-annotated datasets (APTYBase and APTYRanked) to capture fine-grained linguistic changes. It combines generation of paraphrase candidates using ChatGPT across multiple prompting techniques with detailed human annotations and preference rankings, and evaluates LLama 7B variants trained with DPO on the ranked data. Findings show that simple APTs are approachable for current LLMs, while complex structural changes remain difficult, though prompting strategy and human preferences reveal important differences between generation success and perceived quality. The work provides datasets and insights to steer RLHF/DPO-driven improvements and to benchmark models on explicit linguistic capabilities, with broader implications for robust paraphrase generation and evaluation.
Abstract
Paraphrases represent a human's intuitive ability to understand expressions presented in various different ways. Current paraphrase evaluations of language models primarily use binary approaches, offering limited interpretability of specific text changes. Atomic paraphrase types (APT) decompose paraphrases into different linguistic changes and offer a granular view of the flexibility in linguistic expression (e.g., a shift in syntax or vocabulary used). In this study, we assess the human preferences towards ChatGPT in generating English paraphrases with ten APTs and five prompting techniques. We introduce APTY (Atomic Paraphrase TYpes), a dataset of 800 sentence-level and word-level annotations by 15 annotators. The dataset also provides a human preference ranking of paraphrases with different types that can be used to fine-tune models with RLHF and DPO methods. Our results reveal that ChatGPT and a DPO-trained LLama 7B model can generate simple APTs, such as additions and deletions, but struggle with complex structures (e.g., subordination changes). This study contributes to understanding which aspects of paraphrasing language models have already succeeded at understanding and what remains elusive. In addition, we show how our curated datasets can be used to develop language models with specific linguistic capabilities.
