Personal noun detection for German

Carla Sökefeld, Melanie Andresen, Johanna Binnewitt, Heike Zinsmeister


Abstract
Personal nouns, i.e. common nouns denoting human beings, play an important role in manifesting gender and gender stereotypes in texts, especially for languages with grammatical gender like German. Automatically detecting and extracting personal nouns can thus be of interest to a myriad of different tasks such as minimizing gender bias in language models and researching gender stereotypes or gender-fair language, but is complicated by the morphological heterogeneity and homonymy of personal and non-personal nouns, which restrict lexicon-based approaches. In this paper, we introduce a classifier created by fine-tuning a transformer model that detects personal nouns in German. Although some phenomena like homonymy and metalinguistic uses are still problematic, the model is able to classify personal nouns with robust accuracy (f1-score: 0.94).
Anthology ID:
2023.isa-1.5
Volume:
Proceedings of the 19th Joint ACL-ISO Workshop on Interoperable Semantics (ISA-19)
Month:
June
Year:
2023
Address:
Nancy, France
Editor:
Harry Bunt
Venues:
ISA | WS
SIG:
SIGSEM
Publisher:
Association for Computational Linguistics
Note:
Pages:
33–39
Language:
URL:
https://aclanthology.org/2023.isa-1.5
DOI:
Bibkey:
Cite (ACL):
Carla Sökefeld, Melanie Andresen, Johanna Binnewitt, and Heike Zinsmeister. 2023. Personal noun detection for German. In Proceedings of the 19th Joint ACL-ISO Workshop on Interoperable Semantics (ISA-19), pages 33–39, Nancy, France. Association for Computational Linguistics.
Cite (Informal):
Personal noun detection for German (Sökefeld et al., ISA-WS 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-2024-clasp/2023.isa-1.5.pdf