Abstract
In this paper we discuss the usefulness of applying a checking procedure to existing thesauri. The procedure is based on the analysis of discrepancies of corpus-based and thesaurus-based word similarities. We applied the procedure to more than 30 thousand words of the Russian wordnet and found some serious errors in word sense description, including inaccurate relationships and missing senses of ambiguous words.- Anthology ID:
- P19-1577
- Volume:
- Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
- Month:
- July
- Year:
- 2019
- Address:
- Florence, Italy
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 5773–5779
- Language:
- URL:
- https://aclanthology.org/P19-1577
- DOI:
- 10.18653/v1/P19-1577
- Cite (ACL):
- Natalia Loukachevitch. 2019. Corpus-based Check-up for Thesaurus. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 5773–5779, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- Corpus-based Check-up for Thesaurus (Loukachevitch, ACL 2019)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/P19-1577.pdf
- Data
- SemEval-2018 Task 9: Hypernym Discovery