Abstract
We propose the first generative models for three types of extra-grammatical word formation phenomena abounding in slang: Blends, Clippings, and Reduplicatives. Adopting a data-driven approach coupled with linguistic knowledge, we propose simple models with state of the art performance on human annotated gold standard datasets. Overall, our models reveal insights into the generative processes of word formation in slang – insights which are increasingly relevant in the context of the rising prevalence of slang and non-standard varieties on the Internet- Anthology ID:
- N18-1129
- Volume:
- Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers)
- Month:
- June
- Year:
- 2018
- Address:
- New Orleans, Louisiana
- Editors:
- Marilyn Walker, Heng Ji, Amanda Stent
- Venue:
- NAACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1424–1434
- Language:
- URL:
- https://aclanthology.org/N18-1129
- DOI:
- 10.18653/v1/N18-1129
- Cite (ACL):
- Vivek Kulkarni and William Yang Wang. 2018. Simple Models for Word Formation in Slang. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 1424–1434, New Orleans, Louisiana. Association for Computational Linguistics.
- Cite (Informal):
- Simple Models for Word Formation in Slang (Kulkarni & Wang, NAACL 2018)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/N18-1129.pdf
- Code
- viveksck/simplicity