RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation

Yue Zhang¹ Leyang Cui² Enbo Zhao² Wei Bi² Shuming Shi²

¹Soochow University, Suzhou, China

²Tencent AI Lab

Introduction

Grammatical Error Correction (GEC) systems play a vital role in assisting people with their daily writing tasks. However, users may sometimes come across a GEC system that initially performs well but fails to correct errors when the inputs are slightly modified. To ensure an ideal user experience, a reliable GEC system should have the ability to provide consistent and accurate suggestions when encountering irrelevant context perturbations, which we refer to as context robustness. In this paper, we introduce RobustGEC, a benchmark designed to evaluate the context robustness of GEC systems. RobustGEC comprises 5,000 GEC cases, each with one original error-correct sentence pair and five variants carefully devised by human annotators. Utilizing RobustGEC, we reveal that state-of-the-art GEC systems still lack sufficient robustness against context perturbations. Moreover, we propose a simple yet effective method for remitting this issue.

If you are interested in our work, please cite:

@inproceedings{zhang2023robustgec,
  title={RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation},
  author={Zhang, Yue and Cui, Leyang and Zhao, Enbo and Bi, Wei and Shi, Shuming},
  booktitle={Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing},
  pages={16780--16793},
  year={2023}
}

How to Install

You can use the following commands to install the environment for RobustGEC:

conda create -n robustgec python==3.8
conda activate robustgec
cd ./errant
pip install --editable ./
python3 -m spacy download en_core_web_sm

Evaluation on RobustGEC

First, you can make predictions with your own GEC models on the input file in RobustGEC, such as ./benchmark/bea19/input.txt.

Then, you should convert the output file to the specific format for evaluation with convert.py.

O-S My answer is no .
O-T My answer is no .
O-P My answer is no .
A1-S My response is no .
A1-T My response is no .
A1-P My response is no .
A2-S My consequence is no .
A2-T My consequence is no .
A2-P My consequence is no .
A3-S My answer is equivocal .
A3-T My answer is equivocal .
A3-P My answer is equivocal .
A4-S My answer is ambiguous .
A4-T My answer is ambiguous .
A4-P My answer is ambiguous .
A5-S My answer may be no .
A5-T My answer may be no .
A5-P My answer may be no .

Finally, you can use errant_robustgec command for the final robustness evaluation:

errant_robustgec -file <file> -evallog <evallog>

Contact

If you have any questions, please feel free to email me or drop me an issue.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
benchmark		benchmark
errant		errant
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
convert.py		convert.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation

Introduction

How to Install

Evaluation on RobustGEC

Contact

About

Releases

Packages

Languages

License

HillZhang1999/RobustGEC

Folders and files

Latest commit

History

Repository files navigation

RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation

Introduction

How to Install

Evaluation on RobustGEC

Contact

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages