Kyonghee Paik


2004

pdf
A Comparison of Two Variant Corpora: The Same Content with Different Source
Kyonghee Paik | Kiyonori Ohtake | Kazuhide Yamamoto
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

In order to investigate the effect of source language on translations, we investigate two variants of a Korean translation corpus. The first variant consists of Korean translations of 162,308 Japanese sentences from the ATR BTEC (Basic Expression Text Corpus). The second variant was made by translating the English translations of the Japanese sentences into Korean. We show that the source language text has a large influence on the target text. Even after normalizing orthographic differences, fewer than 8.3\% of the sentences in the two variants were identical. We describe in general which phenomena differ and then discuss how our analysis can be used in natural language processing.

pdf
Automatic Construction of a Transfer Dictionary Considering Directionality
Kyonghee Paik | Satoshi Shirai | Hiromi Nakaiwa
Proceedings of the Workshop on Multilingual Linguistic Resources

pdf
Bilingual Knowledge Extraction Using Chunk Alignment
Young-Sook Hwang | Kyonghee Paik | Yutaka Sasaki
Proceedings of the 18th Pacific Asia Conference on Language, Information and Computation

2000

pdf
Reusing an ontology to generate numeral classifiers
Francis Bond | Kyonghee Paik
COLING 2000 Volume 1: The 18th International Conference on Computational Linguistics