Mia-Marie Hammarlin
2023
Investigating the Effects of MWE Identification in Structural Topic Modelling
Dimitrios Kokkinakis
|
Ricardo Muñoz Sánchez
|
Sebastianus Bruinsma
|
Mia-Marie Hammarlin
Proceedings of the 19th Workshop on Multiword Expressions (MWE 2023)
Multiword expressions (MWEs) are common word combinations which exhibit idiosyncrasies in various linguistic levels. For various downstream natural language processing applications and tasks, the identification and discovery of MWEs has been proven to be potentially practical and useful, but still challenging to codify. In this paper we investigate various, relevant to MWE, resources and tools for Swedish, and, within a specific application scenario, namely ‘vaccine skepticism’, we apply structural topic modelling to investigate whether there are any interpretative advantages of identifying MWEs.
Scaling-up the Resources for a Freely Available Swedish VADER (svVADER)
Dimitrios Kokkinakis
|
Ricardo Muñoz Sánchez
|
Mia-Marie Hammarlin
Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
With widespread commercial applications in various domains, sentiment analysis has become a success story for Natural Language Processing (NLP). Still, although sentiment analysis has rapidly progressed during the last years, mainly due to the application of modern AI technologies, many approaches apply knowledge-based strategies, such as lexicon-based, to the task. This is particularly true for analyzing short social media content, e.g., tweets. Moreover, lexicon-based sentiment analysis approaches are usually preferred over learning-based methods when training data is unavailable or insufficient. Therefore, our main goal is to scale-up and apply a lexicon-based approach which can be used as a baseline to Swedish sentiment analysis. All scaled-up resources are made available, while the performance of this enhanced tool is evaluated on two short datasets, achieving adequate results.
Search