Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

Qingyan Guo; Rui Wang; Junliang Guo; Xu Tan; Jiang Bian; Yujiu Yang

Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

Qingyan Guo, Rui Wang, Junliang Guo, Xu Tan, Jiang Bian, Yujiu Yang

TL;DR

Semantic-aware Permutation Training (SPT) is proposed, which addresses the root cause of the reversal curse of causal language models by segmenting the training sentences into semantic units with an assistant language model and permuting these units before feeding into the model.

Abstract

While large language models (LLMs) have achieved impressive performance across diverse tasks, recent studies showcase that causal LLMs suffer from the "reversal curse". It is a typical example that the model knows "A's father is B", but is unable to reason "B's child is A". This limitation poses a challenge to the advancement of artificial general intelligence (AGI), as it suggests a gap in the models' ability to comprehend and apply bidirectional reasoning. In this paper, we first conduct substantial evaluation and identify that the root cause of the reversal curse lies in the different word order between the training and inference stage, namely, the poor ability of causal language models to predict antecedent words within the training data. Accordingly, permutation on the training data is considered as a potential solution, since this can make the model predict antecedent words or tokens. However, previous permutation methods may disrupt complete phrases or entities, thereby posing challenges for the model to comprehend and learn from training data. To address this issue, we propose Semantic-aware Permutation Training (SPT), which addresses this issue by segmenting the training sentences into semantic units (i.e., entities or phrases) with an assistant language model and permuting these units before feeding into the model. Extensive experiments demonstrate that SPT effectively mitigates the reversal curse since the performance on reversed questions approximates that on the forward ones, and significantly advances the performance of existing works.

Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

TL;DR

Abstract

Paper Structure (25 sections, 1 equation, 4 figures, 13 tables)

This paper contains 25 sections, 1 equation, 4 figures, 13 tables.

Introduction
Related Works
Reversal Curse
Permutation Training / Inference
Analysis on the Reversal Curse
Settings
Analysis on the Root Cause
Discussion on the Potential Solutions
Can the reversal curse be mitigated by a lightweight method?
Can the reversal curse be mitigated by permutations?
Semantic-Aware Permutation Training
Experiments
Settings
Results
Celebrity Relation
...and 10 more sections

Figures (4)

Figure 1: Semantic-aware permutation. An assistant model segments the original training sentence into several semantic chunks. Then, we re-order the chunks (including original, permuting or reversal) with a certain probability.
Figure 2: Demonstration used for celebrity relation dataset at inference (w/o CoT).
Figure 3: An example CoT demonstration used for Celebrity Relation dataset at inference for model M1 when tested on question Q1 (w/ CoT, corresponding to Table \ref{['tab:app-cot-demon']}).
Figure 4: Demonstration used for segmenting the sentence into smallest semantic units. The input examples are randomly sampled from Pile pile.

Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

TL;DR

Abstract

Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

Authors

TL;DR

Abstract

Table of Contents

Figures (4)