PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English
Michael Kranzlein, Emma Manning, Siyao Peng, Shira Wein, Aryaman Arora, Nathan Schneider
Abstract
We present the Prepositions Annotated with Supsersense Tags in Reddit International English (“PASTRIE”) corpus, a new dataset containing manually annotated preposition supersenses of English data from presumed speakers of four L1s: English, French, German, and Spanish. The annotations are comprehensive, covering all preposition types and tokens in the sample. Along with the corpus, we provide analysis of distributional patterns across the included L1s and a discussion of the influence of L1s on L2 preposition choice.- Anthology ID:
- 2020.law-1.10
- Volume:
- Proceedings of the 14th Linguistic Annotation Workshop
- Month:
- December
- Year:
- 2020
- Address:
- Barcelona, Spain
- Venue:
- LAW
- SIG:
- SIGANN
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 105–116
- Language:
- URL:
- https://aclanthology.org/2020.law-1.10
- DOI:
- Cite (ACL):
- Michael Kranzlein, Emma Manning, Siyao Peng, Shira Wein, Aryaman Arora, and Nathan Schneider. 2020. PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English. In Proceedings of the 14th Linguistic Annotation Workshop, pages 105–116, Barcelona, Spain. Association for Computational Linguistics.
- Cite (Informal):
- PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English (Kranzlein et al., LAW 2020)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2020.law-1.10.pdf
- Code
- nert-nlp/pastrie
- Data
- PASTRIE