Beyond Error Categories: A Contextual Approach of Evaluating Emerging Spell and Grammar Checkers

Þórunn Arnardóttir, Svanhvít Lilja Ingólfsdóttir, Haukur Barri Símonarson, Hafsteinn Einarsson, Anton Karl Ingason, Vilhjálmur Þorsteinsson


Abstract
Automatic spell and grammar checking can be done using various system architectures, and large language models have recently been used to solve the task with promising results. Here we describe a new method of creating test data to measure the performance of spell and grammar checkers, including large language models. Three types of test data represent different approaches to evaluation, from basic error detection to error correction with natural language explanations of the corrections made and error severity scores, which is the main novelty of this approach. These additions are especially useful when evaluating large language models. We present a spell and grammar checking test set for Icelandic in which the described approach is applied. The data consists of whole texts instead of discrete sentences, which facilitates evaluating context awareness of models. The resulting test set can be used to compare different spell and grammar checkers and is published under permissive licenses.
Anthology ID:
2024.sigul-1.6
Volume:
Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024
Month:
May
Year:
2024
Address:
Torino, Italia
Editors:
Maite Melero, Sakriani Sakti, Claudia Soria
Venues:
SIGUL | WS
SIG:
Publisher:
ELRA and ICCL
Note:
Pages:
45–52
Language:
URL:
https://aclanthology.org/2024.sigul-1.6
DOI:
Bibkey:
Cite (ACL):
Þórunn Arnardóttir, Svanhvít Lilja Ingólfsdóttir, Haukur Barri Símonarson, Hafsteinn Einarsson, Anton Karl Ingason, and Vilhjálmur Þorsteinsson. 2024. Beyond Error Categories: A Contextual Approach of Evaluating Emerging Spell and Grammar Checkers. In Proceedings of the 3rd Annual Meeting of the Special Interest Group on Under-resourced Languages @ LREC-COLING 2024, pages 45–52, Torino, Italia. ELRA and ICCL.
Cite (Informal):
Beyond Error Categories: A Contextual Approach of Evaluating Emerging Spell and Grammar Checkers (Arnardóttir et al., SIGUL-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.sigul-1.6.pdf