Abstract
In this paper we present a new annotation scheme for the Sejong part-of-speech tagged corpus based on Universal Dependencies style annotation. By using a new annotation scheme, we can produce Sejong-style morphological analysis and part-of-speech tagging results which have been the de facto standard for Korean language processing. We also explore the possibility of doing named-entity recognition and semantic-role labelling for Korean using the new annotation scheme.- Anthology ID:
- W19-4022
- Volume:
- Proceedings of the 13th Linguistic Annotation Workshop
- Month:
- August
- Year:
- 2019
- Address:
- Florence, Italy
- Editors:
- Annemarie Friedrich, Deniz Zeyrek, Jet Hoek
- Venue:
- LAW
- SIG:
- SIGANN
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 195–202
- Language:
- URL:
- https://aclanthology.org/W19-4022
- DOI:
- 10.18653/v1/W19-4022
- Cite (ACL):
- Jungyeul Park and Francis Tyers. 2019. A New Annotation Scheme for the Sejong Part-of-speech Tagged Corpus. In Proceedings of the 13th Linguistic Annotation Workshop, pages 195–202, Florence, Italy. Association for Computational Linguistics.
- Cite (Informal):
- A New Annotation Scheme for the Sejong Part-of-speech Tagged Corpus (Park & Tyers, LAW 2019)
- PDF:
- https://preview.aclanthology.org/dois-2013-emnlp/W19-4022.pdf
- Code
- jungyeul/sjmorph