Creative language explorations through a high-expressivity N-grams query language
Carlo Strapparava, Lorenzo Gatti, Marco Guerini, Oliviero Stock
Abstract
In computation linguistics a combination of syntagmatic and paradigmatic features is often exploited. While the first aspects are typically managed by information present in large n-gram databases, domain and ontological aspects are more properly modeled by lexical ontologies such as WordNet and semantic similarity spaces. This interconnection is even stricter when we are dealing with creative language phenomena, such as metaphors, prototypical properties, puns generation, hyperbolae and other rhetorical phenomena. This paper describes a way to focus on and accomplish some of these tasks by exploiting NgramQuery, a generalized query language on Google N-gram database. The expressiveness of this query language is boosted by plugging semantic similarity acquired both from corpora (e.g. LSA) and from WordNet, also integrating operators for phonetics and sentiment analysis. The paper reports a number of examples of usage in some creative language tasks.- Anthology ID:
- L14-1408
- Volume:
- Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
- Month:
- May
- Year:
- 2014
- Address:
- Reykjavik, Iceland
- Editors:
- Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis
- Venue:
- LREC
- SIG:
- Publisher:
- European Language Resources Association (ELRA)
- Note:
- Pages:
- 4326–4330
- Language:
- URL:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/486_Paper.pdf
- DOI:
- Cite (ACL):
- Carlo Strapparava, Lorenzo Gatti, Marco Guerini, and Oliviero Stock. 2014. Creative language explorations through a high-expressivity N-grams query language. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 4326–4330, Reykjavik, Iceland. European Language Resources Association (ELRA).
- Cite (Informal):
- Creative language explorations through a high-expressivity N-grams query language (Strapparava et al., LREC 2014)
- PDF:
- http://www.lrec-conf.org/proceedings/lrec2014/pdf/486_Paper.pdf