Table of Contents
Fetching ...

Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study

Tao Ge, Furu Wei, Ming Zhou

TL;DR

This study tackles data scarcity and single-pass limitations in grammatical error correction by introducing fluency boost learning and fluency boost inference. It presents back-boost, self-boost, and dual-boost strategies to augment training with diverse, fluent–disfluent sentence pairs, and extends to large-scale native data. It further proposes multi-round and round-way inference using left-to-right and right-to-left decoders to incrementally improve fluency and recall. The resulting convolutional seq2seq GEC system achieves state-of-the-art results and reaches human-level performance on CoNLL-2014 and JFLEG, demonstrating the practical impact of data-augmented training and iterative editing for GEC.

Abstract

Neural sequence-to-sequence (seq2seq) approaches have proven to be successful in grammatical error correction (GEC). Based on the seq2seq framework, we propose a novel fluency boost learning and inference mechanism. Fluency boosting learning generates diverse error-corrected sentence pairs during training, enabling the error correction model to learn how to improve a sentence's fluency from more instances, while fluency boosting inference allows the model to correct a sentence incrementally with multiple inference steps. Combining fluency boost learning and inference with convolutional seq2seq models, our approach achieves the state-of-the-art performance: 75.72 (F_{0.5}) on CoNLL-2014 10 annotation dataset and 62.42 (GLEU) on JFLEG test set respectively, becoming the first GEC system that reaches human-level performance (72.58 for CoNLL and 62.37 for JFLEG) on both of the benchmarks.

Reaching Human-level Performance in Automatic Grammatical Error Correction: An Empirical Study

TL;DR

This study tackles data scarcity and single-pass limitations in grammatical error correction by introducing fluency boost learning and fluency boost inference. It presents back-boost, self-boost, and dual-boost strategies to augment training with diverse, fluent–disfluent sentence pairs, and extends to large-scale native data. It further proposes multi-round and round-way inference using left-to-right and right-to-left decoders to incrementally improve fluency and recall. The resulting convolutional seq2seq GEC system achieves state-of-the-art results and reaches human-level performance on CoNLL-2014 and JFLEG, demonstrating the practical impact of data-augmented training and iterative editing for GEC.

Abstract

Neural sequence-to-sequence (seq2seq) approaches have proven to be successful in grammatical error correction (GEC). Based on the seq2seq framework, we propose a novel fluency boost learning and inference mechanism. Fluency boosting learning generates diverse error-corrected sentence pairs during training, enabling the error correction model to learn how to improve a sentence's fluency from more instances, while fluency boosting inference allows the model to correct a sentence incrementally with multiple inference steps. Combining fluency boost learning and inference with convolutional seq2seq models, our approach achieves the state-of-the-art performance: 75.72 (F_{0.5}) on CoNLL-2014 10 annotation dataset and 62.42 (GLEU) on JFLEG test set respectively, becoming the first GEC system that reaches human-level performance (72.58 for CoNLL and 62.37 for JFLEG) on both of the benchmarks.

Paper Structure

This paper contains 16 sections, 8 equations, 4 figures, 5 tables, 3 algorithms.

Figures (4)

  • Figure 1: (a) an error-corrected sentence pair; (b) if the sentence becomes slightly different, the model fails to correct it perfectly; (c) single-round seq2seq inference cannot perfectly correct the sentence, but multi-round inference can.
  • Figure 2: Fluency boost learning and inference: (a) given a training instance (i.e., an error-corrected sentence pair), fluency boost learning establishes multiple fluency boost sentence pairs from the seq2seq's n-best outputs during training. The fluency boost sentence pairs will be used as training instances in subsequent training epochs, which helps expand the training set and accordingly benefits model learning; (b) fluency boost inference allows an error correction model to correct a sentence incrementally through multi-round seq2seq inference as long as its fluency can be improved.
  • Figure 3: Three fluency boost learning strategies: (a) back-boost, (b) self-boost, (c) dual-boost; all of them generate fluency boost sentence pairs (the pairs in the dashed boxes) to help model learning during training. The numbers in this figure are fluency scores of their corresponding sentences.
  • Figure 4: Round-way error correction: some types of errors (e.g., articles) are easier to be corrected by a right-to-left seq2seq model, while some (e.g., subject verb agreement) are more likely to be corrected by a left-to-right seq2seq model. Round-way error correction makes left-to-right and right-to-left seq2seq models well complement each other, enabling it to correct more grammatical errors than an individual model.