Maya Rudich
2020
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
Sina Ahmadi | John P. McCrae | Sanni Nimb | Fahad Khan | Monica Monachini | Bolette S. Pedersen | Thierry Declerck | Tanja Wissik | Andrea Bellandi | Irene Pisani | Thomas Troelsgård | Sussi Olsen | Simon Krek | Veronika Lipp | Tamás Váradi | László Simon | András Győrffy | Carole Tiberius | Tanneke Schoonheim | Yifat Ben Moshe | Maya Rudich | Raya Abu Ahmad | Dorielle Lonke | Kira Kovalenko | Margit Langemets | Jelena Kallas | Oksana Dereza | Theodorus Fransen | David Cillessen | David Lindemann | Mikel Alonso | Ana Salgado | José Luis Sancho | Rafael-J. Ureña-Ruiz | Jordi Porta Zamorano | Kiril Simov | Petya Osenova | Zara Kancheva | Ivaylo Radev | Ranka Stanković | Andrej Perdih | Dejan Gabrovšek
Proceedings of the Twelfth Language Resources and Evaluation Conference
Sina Ahmadi | John P. McCrae | Sanni Nimb | Fahad Khan | Monica Monachini | Bolette S. Pedersen | Thierry Declerck | Tanja Wissik | Andrea Bellandi | Irene Pisani | Thomas Troelsgård | Sussi Olsen | Simon Krek | Veronika Lipp | Tamás Váradi | László Simon | András Győrffy | Carole Tiberius | Tanneke Schoonheim | Yifat Ben Moshe | Maya Rudich | Raya Abu Ahmad | Dorielle Lonke | Kira Kovalenko | Margit Langemets | Jelena Kallas | Oksana Dereza | Theodorus Fransen | David Cillessen | David Lindemann | Mikel Alonso | Ana Salgado | José Luis Sancho | Rafael-J. Ureña-Ruiz | Jordi Porta Zamorano | Kiril Simov | Petya Osenova | Zara Kancheva | Ivaylo Radev | Ranka Stanković | Andrej Perdih | Dejan Gabrovšek
Proceedings of the Twelfth Language Resources and Evaluation Conference
Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data will pave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriously requiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA.
Search
Fix author
Co-authors
- Raya Abu Ahmad 1
- Sina Ahmadi 1
- Mikel Alonso 1
- Andrea Bellandi 1
- Yifat Ben Moshe 1
- David Cillessen 1
- Thierry Declerck 1
- Oksana Dereza 1
- Theodorus Fransen 1
- Dejan Gabrovšek 1
- András Győrffy 1
- Jelena Kallas 1
- Zara Kancheva 1
- Fahad Khan 1
- Kira Kovalenko 1
- Simon Krek 1
- Margit Langemets 1
- David Lindemann 1
- Veronika Lipp 1
- Dorielle Lonke 1
- John Philip McCrae 1
- Monica Monachini 1
- Sanni Nimb 1
- Sussi Olsen 1
- Petya Osenova 1
- Bolette Sandford Pedersen 1
- Andrej Perdih 1
- Irene Pisani 1
- Ivaylo Radev 1
- Ana Salgado 1
- José-Luis Sancho 1
- Tanneke Schoonheim 1
- László Simon 1
- Kiril Simov 1
- Ranka Stanković 1
- Carole Tiberius 1
- Thomas Troelsgård 1
- Rafael-J. Ureña-Ruiz 1
- Tamás Váradi 1
- Tanja Wissik 1
- Jordi Porta Zamorano 1
Venues
- lrec1