Transformer-Based Real-Word Spelling Error Feedback with Configurable Confusion Sets

Torsten Zesch, Dominic Gardner, Marie Bexte


Abstract
Real-word spelling errors (RWSEs) pose special challenges for detection methods, as they ‘hide’ in the form of another existing word and in many cases even fit in syntactically. We present a modern Transformer-based implementation of earlier probabilistic methods based on confusion sets and show that RWSEs can be detected with a good balance between missing errors and raising too many falsealarms. The confusion sets are dynamically configurable, allowing teachers to easily adjust which errors trigger feedback.
Anthology ID:
2025.bea-1.29
Volume:
Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Ekaterina Kochmar, Bashar Alhafni, Marie Bexte, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Anaïs Tack, Victoria Yaneva, Zheng Yuan
Venues:
BEA | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
375–383
Language:
URL:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bea-1.29/
DOI:
Bibkey:
Cite (ACL):
Torsten Zesch, Dominic Gardner, and Marie Bexte. 2025. Transformer-Based Real-Word Spelling Error Feedback with Configurable Confusion Sets. In Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025), pages 375–383, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Transformer-Based Real-Word Spelling Error Feedback with Configurable Confusion Sets (Zesch et al., BEA 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/acl25-workshop-ingestion/2025.bea-1.29.pdf