InsertGNN: Can Graph Neural Networks Outperform Humans in TOEFL Sentence Insertion Problem?

Fang Wu; Stan Z. Li

InsertGNN: Can Graph Neural Networks Outperform Humans in TOEFL Sentence Insertion Problem?

Fang Wu, Stan Z. Li

TL;DR

This work tackles the sentence insertion problem by reformulating SI as a directed sentence graph and introducing InsertGNN, a global-local fused Graph Neural Network designed to capture both cross-sentence coherence and local cueing. The model combines a Global Graph Attention Network for global context with a Local Graph Network that focuses on nearby sentence interactions, and fuses these signals to predict insertion points with a triple BCE loss. On TOEFL data, InsertGNN achieves up to about 71.5% accuracy, rivaling average human performance, and demonstrates cross-domain transfer to arXiv with competitive TOEFL results, outperforming several baselines and some LLM prompts. The results validate the graph-based SI paradigm and indicate strong potential for cross-domain generalization, while also highlighting dataset limitations and avenues for future work with larger datasets and more capable LLMs.

Abstract

The integration of sentences poses an intriguing challenge within the realm of NLP, but it has not garnered the attention it deserves. Existing methods that focus on sentence arrangement, textual consistency, and question answering are inadequate in addressing this issue. To bridge this gap, we introduce InsertGNN, which conceptualizes the problem as a graph and employs a hierarchical Graph Neural Network (GNN) to comprehend the interplay between sentences. Our approach was rigorously evaluated on a TOEFL dataset, and its efficacy was further validated on the expansive arXiv dataset using cross-domain learning. Thorough experimentation unequivocally establishes InsertGNN's superiority over all comparative benchmarks, achieving an impressive 70% accuracy, a performance on par with average human test scores.

InsertGNN: Can Graph Neural Networks Outperform Humans in TOEFL Sentence Insertion Problem?

TL;DR

Abstract

InsertGNN: Can Graph Neural Networks Outperform Humans in TOEFL Sentence Insertion Problem?

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)