A Framenet and Frame Annotator for German Social Media

Eckhard Bick


Abstract
This paper presents PFN-DE, a new, parsing- and annotation-oriented framenet for German, with almost 15,000 frames, covering 11,300 verb lemmas. The resource was developed in the context of a Danish/German social-media study on hate speech and has a strong focus on coverage, robustness and cross-language comparability. A simple annotation scheme for argument roles meshes directly with the output of a syntactic parser, facilitating frame disambiguation through slot-filler conditions based on valency, syntactic function and semantic noun class. We discuss design principles for the framenet and the frame tagger using it, and present statistics for frame and role distribution at both the lexicon (type) and corpus (token) levels. In an evaluation run on Twitter data, the parser-based frame annotator achieved an overall F-score for frame senses of 93.6%.
Anthology ID:
2022.lrec-1.419
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
3942–3949
Language:
URL:
https://aclanthology.org/2022.lrec-1.419
DOI:
Bibkey:
Cite (ACL):
Eckhard Bick. 2022. A Framenet and Frame Annotator for German Social Media. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 3942–3949, Marseille, France. European Language Resources Association.
Cite (Informal):
A Framenet and Frame Annotator for German Social Media (Bick, LREC 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/2022.lrec-1.419.pdf
Data
FrameNet