Parser Evaluation for Analyzing Swedish 19th-20th Century Literature

Sara Stymne, Carin Östman, David Håkansson


Abstract
In this study, we aim to find a parser for accurately identifying different types of subordinate clauses, and related phenomena, in 19th–20th-century Swedish literature. Since no test set is available for parsing from this time period, we propose a lightweight annotation scheme for annotating a single relation of interest per sentence. We train a variety of parsers for Swedish and compare evaluations on standard modern test sets and our targeted test set. We find clear trends in which parser types perform best on the standard test sets, but that performance is considerably more varied on the targeted test set. We believe that our proposed annotation scheme can be useful for complementing standard evaluations, with a low annotation effort.
Anthology ID:
2023.nodalida-1.35
Volume:
Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
Month:
May
Year:
2023
Address:
Tórshavn, Faroe Islands
Editors:
Tanel Alumäe, Mark Fishel
Venue:
NoDaLiDa
SIG:
Publisher:
University of Tartu Library
Note:
Pages:
335–346
Language:
URL:
https://aclanthology.org/2023.nodalida-1.35
DOI:
Bibkey:
Cite (ACL):
Sara Stymne, Carin Östman, and David Håkansson. 2023. Parser Evaluation for Analyzing Swedish 19th-20th Century Literature. In Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa), pages 335–346, Tórshavn, Faroe Islands. University of Tartu Library.
Cite (Informal):
Parser Evaluation for Analyzing Swedish 19th-20th Century Literature (Stymne et al., NoDaLiDa 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-5/2023.nodalida-1.35.pdf