Neural Automated Writing Evaluation with Corrective Feedback

Izia Xiaoxiao Wang; Xihan Wu; Edith Coates; Min Zeng; Jiexin Kuang; Siliang Liu; Mengyang Qiu; Jungyeul Park

Neural Automated Writing Evaluation with Corrective Feedback

Izia Xiaoxiao Wang, Xihan Wu, Edith Coates, Min Zeng, Jiexin Kuang, Siliang Liu, Mengyang Qiu, Jungyeul Park

TL;DR

The paper addresses the need for efficient, immediate feedback in second language writing by integrating neural automated writing evaluation (AWE) with grammatical error correction (GEC). It combines a denoised ASAP/ASAP++ based AWE component with a seq2seq/ensemble GEC module trained on BEA2019 data to produce inline corrections and multi-rubric scores, enabling simulated examination practice. The system supports teacher involvement, cross-language expansion through multilingual rubrics, and planned perception studies to assess usability and fairness. It achieves competitive GEC performance (best BERT-based model with $F_{0.5}=65.29$ on BEA2019) and demonstrates a concrete path toward scalable, language-agnostic evaluation with corrective feedback that reduces instructor workload.

Abstract

The utilization of technology in second language learning and teaching has become ubiquitous. For the assessment of writing specifically, automated writing evaluation (AWE) and grammatical error correction (GEC) have become immensely popular and effective methods for enhancing writing proficiency and delivering instant and individualized feedback to learners. By leveraging the power of natural language processing (NLP) and machine learning algorithms, AWE and GEC systems have been developed separately to provide language learners with automated corrective feedback and more accurate and unbiased scoring that would otherwise be subject to examiners. In this paper, we propose an integrated system for automated writing evaluation with corrective feedback as a means of bridging the gap between AWE and GEC results for second language learners. This system enables language learners to simulate the essay writing tests: a student writes and submits an essay, and the system returns the assessment of the writing along with suggested grammatical error corrections. Given that automated scoring and grammatical correction are more efficient and cost-effective than human grading, this integrated system would also alleviate the burden of manually correcting innumerable essays.

Neural Automated Writing Evaluation with Corrective Feedback

TL;DR

on BEA2019) and demonstrates a concrete path toward scalable, language-agnostic evaluation with corrective feedback that reduces instructor workload.

Abstract

Paper Structure (14 sections, 3 figures, 2 tables)

This paper contains 14 sections, 3 figures, 2 tables.

Introduction
Previous Work
AWE
GEC
System Description
AWE
GEC
Discussion and Future Perspectives
Features and metrics
Multilingual aspects for GEC
Multilingual aspects for AWE and universal rubrics
Teacher's view
Perception problem
Conclusion

Figures (3)

Figure 1: System workflow of integrated AWE and GEC for language learners. A user flow can be applied to simulate examination situations: the language learners receive instant objective scoring results and corrective feedback.
Figure 2: Interface screenshot for AWE and GEC results: tokens in the faded red rectangles have been deleted; tokens in the green rectangles are corresponding grammatical corrections inserted by GEC; AWE scores are scaled to 0-100.
Figure 3: Future system workflow for the language learners and the instructors

Neural Automated Writing Evaluation with Corrective Feedback

TL;DR

Abstract

Neural Automated Writing Evaluation with Corrective Feedback

Authors

TL;DR

Abstract

Table of Contents

Figures (3)