2020
pdf
Enhancing Quality of Corpus Annotation: Construction of the Multi-Layer Corpus Annotation and Simplified Validation of the Corpus Annotation
Youngbin Noh
|
Kuntae Kim
|
Minho Lee
|
Cheolhun Heo
|
Yongbin Jeong
|
Yoosung Jeong
|
Younggyun Hahm
|
Taehwan Oh
|
Hyonsu Choe
|
Seokwon Park
|
Jin-Dong Kim
|
Key-Sun Choi
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
pdf
abs
Crowdsourcing in the Development of a Multilingual FrameNet: A Case Study of Korean FrameNet
Younggyun Hahm
|
Youngbin Noh
|
Ji Yoon Han
|
Tae Hwan Oh
|
Hyonsu Choe
|
Hansaem Kim
|
Key-Sun Choi
Proceedings of the Twelfth Language Resources and Evaluation Conference
Using current methods, the construction of multilingual resources in FrameNet is an expensive and complex task. While crowdsourcing is a viable alternative, it is difficult to include non-native English speakers in such efforts as they often have difficulty with English-based FrameNet tools. In this work, we investigated cross-lingual issues in crowdsourcing approaches for multilingual FrameNets, specifically in the context of the newly constructed Korean FrameNet. To accomplish this, we evaluated the effectiveness of various crowdsourcing settings whereby certain types of information are provided to workers, such as English definitions in FrameNet or translated definitions. We then evaluated whether the crowdsourced results accurately captured the meaning of frames both cross-culturally and cross-linguistically, and found that by allowing the crowd workers to make intuitive choices, they achieved a quality comparable to that of trained FrameNet experts (F1 > 0.75). The outcomes of this work are now publicly available as a new release of Korean FrameNet 1.1.
2018
pdf
abs
Enhancing Universal Dependencies for Korean
Youngbin Noh
|
Jiyoon Han
|
Tae Hwan Oh
|
Hansaem Kim
Proceedings of the Second Workshop on Universal Dependencies (UDW 2018)
In this paper, for the purpose of enhancing Universal Dependencies for the Korean language, we propose a modified method for mapping Korean Part-of-Speech(POS) tagset in relation to Universal Part-of-Speech (UPOS) tagset in order to enhance the Universal Dependencies for the Korean Language. Previous studies suggest that UPOS reflects several issues that influence dependency annotation by using the POS of Korean predicates, particularly the distinctiveness in using verb, adjective, and copula.