Context-aware Swedish Lexical Simplification

Emil Graichen, Arne Jonsson


Abstract
We present results from the development and evaluation of context-aware Lexical simplification (LS) systems for the Swedish language. Three versions of LS models, LäsBERT, LäsBERT-baseline, and LäsGPT, were created and evaluated on a newly constructed Swedish LS evaluation dataset. The LS systems demonstrated promising potential in aiding audiences with reading difficulties by providing context-aware word replacements. While there were areas for improvement, particularly in complex word identification, the systems showed agreement with human annotators on word replacements.
Anthology ID:
2023.tsar-1.2
Volume:
Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Sanja Štajner, Horacio Saggio, Matthew Shardlow, Fernando Alva-Manchego
Venues:
TSAR | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
11–20
Language:
URL:
https://aclanthology.org/2023.tsar-1.2
DOI:
Bibkey:
Cite (ACL):
Emil Graichen and Arne Jonsson. 2023. Context-aware Swedish Lexical Simplification. In Proceedings of the Second Workshop on Text Simplification, Accessibility and Readability, pages 11–20, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Context-aware Swedish Lexical Simplification (Graichen & Jonsson, TSAR-WS 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2023.tsar-1.2.pdf