Archaeology at MLSP 2024: Machine Translation for Lexical Complexity Prediction and Lexical Simplification

Petru Cristea, Sergiu Nisioi


Abstract
We present the submissions of team Archaeology for the Lexical Simplification and Lexical Complexity Prediction Shared Tasks at BEA2024. Our approach for this shared task consists in creating two pipelines for generating lexical substitutions and estimating the complexity: one using machine translation texts into English and one using the original language.For the LCP subtask, our xgb regressor is trained with engineered features (based primarily on English language resources) and shallow word structure features. For the LS subtask we use a locally-executed quantized LLM to generate candidates and sort them by complexity score computed using the pipeline designed for LCP.These pipelines provide distinct perspectives on the lexical simplification process, offering insights into the efficacy and limitations of employing Machine Translation versus direct processing on the original language data.
Anthology ID:
2024.bea-1.55
Volume:
Proceedings of the 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2024)
Month:
June
Year:
2024
Address:
Mexico City, Mexico
Editors:
Ekaterina Kochmar, Marie Bexte, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Anaïs Tack, Victoria Yaneva, Zheng Yuan
Venue:
BEA
SIG:
SIGEDU
Publisher:
Association for Computational Linguistics
Note:
Pages:
610–617
Language:
URL:
https://aclanthology.org/2024.bea-1.55
DOI:
Bibkey:
Cite (ACL):
Petru Cristea and Sergiu Nisioi. 2024. Archaeology at MLSP 2024: Machine Translation for Lexical Complexity Prediction and Lexical Simplification. In Proceedings of the 19th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2024), pages 610–617, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):
Archaeology at MLSP 2024: Machine Translation for Lexical Complexity Prediction and Lexical Simplification (Cristea & Nisioi, BEA 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/jeptaln-2024-ingestion/2024.bea-1.55.pdf