Abstract
In this paper we present our contribution to the TSAR-2022 Shared Task on Lexical Simplification of the EMNLP 2022 Workshop on Text Simplification, Accessibility, and Readability. Our approach builds on and extends the unsupervised lexical simplification system with pretrained encoders (LSBert) system introduced in Qiang et al. (2020) in the following ways: For the subtask of simplification candidate selection, it utilizes a RoBERTa transformer language model and expands the size of the generated candidate list. For subsequent substitution ranking, it introduces a new feature weighting scheme and adopts a candidate filtering method based on textual entailment to maximize semantic similarity between the target word and its simplification. Our best-performing system improves LSBert by 5.9% accuracy and achieves second place out of 33 ranked solutions.- Anthology ID:
- 2022.tsar-1.27
- Volume:
- Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022)
- Month:
- December
- Year:
- 2022
- Address:
- Abu Dhabi, United Arab Emirates (Virtual)
- Venue:
- TSAR
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 243–250
- Language:
- URL:
- https://aclanthology.org/2022.tsar-1.27
- DOI:
- Cite (ACL):
- Xiaofei Li, Daniel Wiechmann, Yu Qiao, and Elma Kerz. 2022. MANTIS at TSAR-2022 Shared Task: Improved Unsupervised Lexical Simplification with Pretrained Encoders. In Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022), pages 243–250, Abu Dhabi, United Arab Emirates (Virtual). Association for Computational Linguistics.
- Cite (Informal):
- MANTIS at TSAR-2022 Shared Task: Improved Unsupervised Lexical Simplification with Pretrained Encoders (Li et al., TSAR 2022)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/2022.tsar-1.27.pdf