Abstract
This paper presents a simple method for German compound splitting that combines a basic frequency-based approach with a form-to-lemma mapping to approximate morphological operations. With the exception of a small set of hand-crafted rules for modeling transitional elements, this approach is resource-poor. In our evaluation, the simple splitter outperforms a splitter relying on rich morphological resources.- Anthology ID:
- W17-1722
- Volume:
- Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017)
- Month:
- April
- Year:
- 2017
- Address:
- Valencia, Spain
- Editors:
- Stella Markantonatou, Carlos Ramisch, Agata Savary, Veronika Vincze
- Venue:
- MWE
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 161–166
- Language:
- URL:
- https://aclanthology.org/W17-1722
- DOI:
- 10.18653/v1/W17-1722
- Cite (ACL):
- Marion Weller-Di Marco. 2017. Simple Compound Splitting for German. In Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), pages 161–166, Valencia, Spain. Association for Computational Linguistics.
- Cite (Informal):
- Simple Compound Splitting for German (Weller-Di Marco, MWE 2017)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/W17-1722.pdf