Infinite SCAN: An Infinite Model of Diachronic Semantic Change

Seiichi Inoue, Mamoru Komachi, Toshinobu Ogiso, Hiroya Takamura, Daichi Mochihashi


Abstract
In this study, we propose a Bayesian model that can jointly estimate the number of senses of words and their changes through time. The model combines a dynamic topic model on Gaussian Markov random fields with a logistic stick-breaking process that realizes Dirichlet process. In the experiments, we evaluated the proposed model in terms of interpretability, accuracy in estimating the number of senses, and tracking their changes using both artificial data and real data. We quantitatively verified that the model behaves as expected through evaluation using artificial data. Using the CCOHA corpus, we showed that our model outperforms the baseline model and investigated the semantic changes of several well-known target words.
Anthology ID:
2022.emnlp-main.104
Volume:
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1605–1616
Language:
URL:
https://aclanthology.org/2022.emnlp-main.104
DOI:
10.18653/v1/2022.emnlp-main.104
Bibkey:
Cite (ACL):
Seiichi Inoue, Mamoru Komachi, Toshinobu Ogiso, Hiroya Takamura, and Daichi Mochihashi. 2022. Infinite SCAN: An Infinite Model of Diachronic Semantic Change. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 1605–1616, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
Infinite SCAN: An Infinite Model of Diachronic Semantic Change (Inoue et al., EMNLP 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-2023-videos/2022.emnlp-main.104.pdf