A New Annotation Scheme for the Sejong Part-of-speech Tagged Corpus

Jungyeul Park, Francis Tyers


Abstract
In this paper we present a new annotation scheme for the Sejong part-of-speech tagged corpus based on Universal Dependencies style annotation. By using a new annotation scheme, we can produce Sejong-style morphological analysis and part-of-speech tagging results which have been the de facto standard for Korean language processing. We also explore the possibility of doing named-entity recognition and semantic-role labelling for Korean using the new annotation scheme.
Anthology ID:
W19-4022
Volume:
Proceedings of the 13th Linguistic Annotation Workshop
Month:
August
Year:
2019
Address:
Florence, Italy
Editors:
Annemarie Friedrich, Deniz Zeyrek, Jet Hoek
Venue:
LAW
SIG:
SIGANN
Publisher:
Association for Computational Linguistics
Note:
Pages:
195–202
Language:
URL:
https://aclanthology.org/W19-4022
DOI:
10.18653/v1/W19-4022
Bibkey:
Cite (ACL):
Jungyeul Park and Francis Tyers. 2019. A New Annotation Scheme for the Sejong Part-of-speech Tagged Corpus. In Proceedings of the 13th Linguistic Annotation Workshop, pages 195–202, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):
A New Annotation Scheme for the Sejong Part-of-speech Tagged Corpus (Park & Tyers, LAW 2019)
Copy Citation:
PDF:
https://preview.aclanthology.org/dois-2013-emnlp/W19-4022.pdf
Code
 jungyeul/sjmorph