Abstract
Word sense disambiguation (WSD) is the task of identifying the intended sense of a word in context. While prior work on unsupervised WSD has leveraged lexical knowledge bases, such as WordNet and BabelNet, these resources have proven to be less effective for Chinese. Instead, the most widely used lexical knowledge base for Chinese is HowNet. Previous HowNet-based WSD methods have not exploited contextual translation information. In this paper, we present the first HowNet-based WSD system which combines monolingual contextual information from a pretrained neural language model with bilingual information obtained via machine translation and sense translation information from HowNet. The results of our evaluation experiment on a test set from prior work demonstrate that our new method achieves a new state of the art for unsupervised Chinese WSD.- Anthology ID:
- 2022.findings-emnlp.331
- Volume:
- Findings of the Association for Computational Linguistics: EMNLP 2022
- Month:
- December
- Year:
- 2022
- Address:
- Abu Dhabi, United Arab Emirates
- Editors:
- Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 4530–4536
- Language:
- URL:
- https://aclanthology.org/2022.findings-emnlp.331
- DOI:
- 10.18653/v1/2022.findings-emnlp.331
- Cite (ACL):
- Xiang Zhang, Bradley Hauer, and Grzegorz Kondrak. 2022. Improving HowNet-Based Chinese Word Sense Disambiguation with Translations. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 4530–4536, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
- Cite (Informal):
- Improving HowNet-Based Chinese Word Sense Disambiguation with Translations (Zhang et al., Findings 2022)
- PDF:
- https://preview.aclanthology.org/naacl-24-ws-corrections/2022.findings-emnlp.331.pdf