Abstract
Modeling derivational morphology to generate words with particular semantics is useful in many text generation tasks, such as machine translation or abstractive question answering. In this work, we tackle the task of derived word generation. That is, we attempt to generate the word “runner” for “someone who runs.” We identify two key problems in generating derived words from root words and transformations. We contribute a novel aggregation model of derived word generation that learns derivational transformations both as orthographic functions using sequence-to-sequence models and as functions in distributional word embedding space. The model then learns to choose between the hypothesis of each system. We also present two ways of incorporating corpus information into derived word generation.- Anthology ID:
- P18-1180
- Volume:
- Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- July
- Year:
- 2018
- Address:
- Melbourne, Australia
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1938–1947
- Language:
- URL:
- https://aclanthology.org/P18-1180
- DOI:
- 10.18653/v1/P18-1180
- Cite (ACL):
- Daniel Deutsch, John Hewitt, and Dan Roth. 2018. A Distributional and Orthographic Aggregation Model for English Derivational Morphology. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1938–1947, Melbourne, Australia. Association for Computational Linguistics.
- Cite (Informal):
- A Distributional and Orthographic Aggregation Model for English Derivational Morphology (Deutsch et al., ACL 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/P18-1180.pdf
- Code
- danieldeutsch/acl2018