English-Turkish Parallel Semantic Annotation of Penn-Treebank
Bilge Nas Arıcan, Özge Bakay, Begüm Avar, Olcay Taner Yıldız, Özlem Ergelen
Abstract
This paper reports our efforts in constructing a sense-labeled English-Turkish parallel corpus using the traditional method of manual tagging. We tagged a pre-built parallel treebank which was translated from the Penn Treebank corpus. This approach allowed us to generate a resource combining syntactic and semantic information. We provide statistics about the corpus itself as well as information regarding its development process.- Anthology ID:
- 2019.gwc-1.38
- Volume:
- Proceedings of the 10th Global Wordnet Conference
- Month:
- July
- Year:
- 2019
- Address:
- Wroclaw, Poland
- Editors:
- Piek Vossen, Christiane Fellbaum
- Venue:
- GWC
- SIG:
- SIGLEX
- Publisher:
- Global Wordnet Association
- Note:
- Pages:
- 298–306
- Language:
- URL:
- https://aclanthology.org/2019.gwc-1.38
- DOI:
- Cite (ACL):
- Bilge Nas Arıcan, Özge Bakay, Begüm Avar, Olcay Taner Yıldız, and Özlem Ergelen. 2019. English-Turkish Parallel Semantic Annotation of Penn-Treebank. In Proceedings of the 10th Global Wordnet Conference, pages 298–306, Wroclaw, Poland. Global Wordnet Association.
- Cite (Informal):
- English-Turkish Parallel Semantic Annotation of Penn-Treebank (Arıcan et al., GWC 2019)
- PDF:
- https://preview.aclanthology.org/ml4al-ingestion/2019.gwc-1.38.pdf