Edition 2.0 of the PARSEME shared task on multilingual identification and paraphrasing of multiword expressions
Manon Scholivet, Agata Savary, Carlos Ramisch, Eric Bilinski, Takuya Nakamura, Maria Mitrofan, Vasile Pais
Abstract
Multiword expressions (MWEs) have been a major challenge in NLP for decades and research on MWEs was driven notably by shared tasks, including those organized by the PARSEME community. We report the organisation and the results of edition 2.0 of the PARSEME shared task. For the first time, all syntactic categories are covered: verbal, nominal, adjectival, adverbial and functional. We rely on edition 2.0 of the PARSEME corpus, annotated for all these categories in 17 languages. We create a new dataset with paraphrases of sentences containing idioms in 14 languages, and defining a new subtask dedicated to MWE paraphrasing. We extend our evaluation protocol by measuring both performance and diversity of systems, and including manual evaluation in paraphrasing. 10 systems, including the baseline, participated in the MWE identification subtask and 5 in the paraphrasing subtask. Results are promising, but known MWE identification challenges remain unsolved. Performance correlates positively with diversity in MWE identification, and negatively in MWE paraphrasing.- Anthology ID:
- 2026.mwe-1.33
- Volume:
- Proceedings of the 22nd Workshop on Multiword Expressions (MWE 2026)
- Month:
- March
- Year:
- 2026
- Address:
- Rabat, Marocco
- Editors:
- Atul Kr. Ojha, Verginica Barbu Mititelu, Mathieu Constant, Ivelina Stoyanova, A. Seza Doğruöz, Alexandre Rademaker
- Venues:
- MWE | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 254–275
- Language:
- URL:
- https://preview.aclanthology.org/ingest-eacl/2026.mwe-1.33/
- DOI:
- Cite (ACL):
- Manon Scholivet, Agata Savary, Carlos Ramisch, Eric Bilinski, Takuya Nakamura, Maria Mitrofan, and Vasile Pais. 2026. Edition 2.0 of the PARSEME shared task on multilingual identification and paraphrasing of multiword expressions. In Proceedings of the 22nd Workshop on Multiword Expressions (MWE 2026), pages 254–275, Rabat, Marocco. Association for Computational Linguistics.
- Cite (Informal):
- Edition 2.0 of the PARSEME shared task on multilingual identification and paraphrasing of multiword expressions (Scholivet et al., MWE 2026)
- PDF:
- https://preview.aclanthology.org/ingest-eacl/2026.mwe-1.33.pdf