Benchmarking Lexical Simplification Systems

Gustavo Paetzold, Lucia Specia


Abstract
Lexical Simplification is the task of replacing complex words in a text with simpler alternatives. A variety of strategies have been devised for this challenge, yet there has been little effort in comparing their performance. In this contribution, we present a benchmarking of several Lexical Simplification systems. By combining resources created in previous work with automatic spelling and inflection correction techniques, we introduce BenchLS: a new evaluation dataset for the task. Using BenchLS, we evaluate the performance of solutions for various steps in the typical Lexical Simplification pipeline, both individually and jointly. This is the first time Lexical Simplification systems are compared in such fashion on the same data, and the findings introduce many contributions to the field, revealing several interesting properties of the systems evaluated.
Anthology ID:
L16-1491
Volume:
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:
May
Year:
2016
Address:
Portorož, Slovenia
Editors:
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3074–3080
Language:
URL:
https://aclanthology.org/L16-1491
DOI:
Bibkey:
Cite (ACL):
Gustavo Paetzold and Lucia Specia. 2016. Benchmarking Lexical Simplification Systems. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 3074–3080, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):
Benchmarking Lexical Simplification Systems (Paetzold & Specia, LREC 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ml4al-ingestion/L16-1491.pdf