Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training

Chang Su; Jiexing Qi; He Yan; Kai Zou; Zhouhan Lin

Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training

Chang Su, Jiexing Qi, He Yan, Kai Zou, Zhouhan Lin

TL;DR

This work proposes an additional pre-training stage with a new objective, Triplet Order Correction (TOC), along with the commonly used Masked Language Modeling (MLM), to collectively enhance the model's sensitivity to triplet order and SPARQL syntax.

Abstract

Semantic parsing that translates natural language queries to SPARQL is of great importance for Knowledge Graph Question Answering (KGQA) systems. Although pre-trained language models like T5 have achieved significant success in the Text-to-SPARQL task, their generated outputs still exhibit notable errors specific to the SPARQL language, such as triplet flips. To address this challenge and further improve the performance, we propose an additional pre-training stage with a new objective, Triplet Order Correction (TOC), along with the commonly used Masked Language Modeling (MLM), to collectively enhance the model's sensitivity to triplet order and SPARQL syntax. Our method achieves state-of-the-art performances on three widely-used benchmarks.

Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training

TL;DR

Abstract

Paper Structure (13 sections, 1 equation, 2 figures, 3 tables)

This paper contains 13 sections, 1 equation, 2 figures, 3 tables.

Introduction
Preliminaries
METHOD
Triplet-order-sensitive Pre-training
Fine-tuning on Downstream Task
Verbalizing IRIs
EXPERIMENT
Basic settings
Main Results
Error Analysis
Ablation Study
Tough Scenario
CONCLUSION

Figures (2)

Figure 1: The overview of our approach. The TosT5 model first undergoes the triplet-order-sensitive pre-training stage and then is fine-tuned on the downstream task.
Figure 2: Error analysis on models' prediction. "tfe" is short for "triplet-flip error", and "te" is short for "triplet error".

Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training

TL;DR

Abstract

Enhancing SPARQL Generation by Triplet-order-sensitive Pre-training

Authors

TL;DR

Abstract

Table of Contents

Figures (2)