Young-Gil Kim
Also published as: Young-Kill Kim, Young Kil Kim, Young-Kil Kim, YoungKil Kim, Youngkil Kim, Young-gil Kim
2020
POSTECH-ETRI’s Submission to the WMT2020 APE Shared Task: Automatic Post-Editing with Cross-lingual Language Model
Jihyung Lee | WonKee Lee | Jaehun Shin | Baikjin Jung | Young-Kil Kim | Jong-Hyeok Lee
Proceedings of the Fifth Conference on Machine Translation
Jihyung Lee | WonKee Lee | Jaehun Shin | Baikjin Jung | Young-Kil Kim | Jong-Hyeok Lee
Proceedings of the Fifth Conference on Machine Translation
This paper describes POSTECH-ETRI’s submission to WMT2020 for the shared task on automatic post-editing (APE) for 2 language pairs: English-German (En-De) and English-Chinese (En-Zh). We propose APE systems based on a cross-lingual language model, which jointly adopts translation language modeling (TLM) and masked language modeling (MLM) training objectives in the pre-training stage; the APE models then utilize jointly learned language representations between the source language and the target language. In addition, we created 19 million new sythetic triplets as additional training data for our final ensemble model. According to experimental results on the WMT2020 APE development data set, our models showed an improvement over the baseline by TER of -3.58 and a BLEU score of +5.3 for the En-De subtask; and TER of -5.29 and a BLEU score of +7.32 for the En-Zh subtask.
2019
JBNU at MRP 2019: Multi-level Biaffine Attention for Semantic Dependency Parsing
Seung-Hoon Na | Jinwoon Min | Kwanghyeon Park | Jong-Hun Shin | Young-Kil Kim
Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning
Seung-Hoon Na | Jinwoon Min | Kwanghyeon Park | Jong-Hun Shin | Young-Kil Kim
Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning
This paper describes Jeonbuk National University (JBNU)’s system for the 2019 shared task on Cross-Framework Meaning Representation Parsing (MRP 2019) at the Conference on Computational Natural Language Learning. Of the five frameworks, we address only the DELPH-IN MRS Bi-Lexical Dependencies (DP), Prague Semantic Dependencies (PSD), and Universal Conceptual Cognitive Annotation (UCCA) frameworks. We propose a unified parsing model using biaffine attention (Dozat and Manning, 2017), consisting of 1) a BERT-BiLSTM encoder and 2) a biaffine attention decoder. First, the BERT-BiLSTM for sentence encoder uses BERT to compose a sentence’s wordpieces into word-level embeddings and subsequently applies BiLSTM to word-level representations. Second, the biaffine attention decoder determines the scores for an edge’s existence and its labels based on biaffine attention functions between roledependent representations. We also present multi-level biaffine attention models by combining all the role-dependent representations that appear at multiple intermediate layers.
Data Augmentation by Data Noising for Open-vocabulary Slots in Spoken Language Understanding
Hwa-Yeon Kim | Yoon-Hyung Roh | Young-Kil Kim
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop
Hwa-Yeon Kim | Yoon-Hyung Roh | Young-Kil Kim
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop
One of the main challenges in Spoken Language Understanding (SLU) is dealing with ‘open-vocabulary’ slots. Recently, SLU models based on neural network were proposed, but it is still difficult to recognize the slots of unknown words or ‘open-vocabulary’ slots because of the high cost of creating a manually tagged SLU dataset. This paper proposes data noising, which reflects the characteristics of the ‘open-vocabulary’ slots, for data augmentation. We applied it to an attention based bi-directional recurrent neural network (Liu and Lane, 2016) and experimented with three datasets: Airline Travel Information System (ATIS), Snips, and MIT-Restaurant. We achieved performance improvements of up to 0.57% and 3.25 in intent prediction (accuracy) and slot filling (f1-score), respectively. Our method is advantageous because it does not require additional memory and it can be applied simultaneously with the training process of the model.
2018
Improving a Multi-Source Neural Machine Translation Model with Corpus Extension for Low-Resource Languages
Gyu-Hyeon Choi | Jong-Hun Shin | Young-Kil Kim
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Gyu-Hyeon Choi | Jong-Hun Shin | Young-Kil Kim
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Two-Step Training and Mixed Encoding-Decoding for Implementing a Generative Chatbot with a Small Dialogue Corpus
Jintae Kim | Hyeon-Gu Lee | Harksoo Kim | Yeonsoo Lee | Young-Gil Kim
Proceedings of the Workshop on Intelligent Interactive Systems and Language Generation (2IS&NLG)
Jintae Kim | Hyeon-Gu Lee | Harksoo Kim | Yeonsoo Lee | Young-Gil Kim
Proceedings of the Workshop on Intelligent Interactive Systems and Language Generation (2IS&NLG)
Automatic Evaluation of English-to-Korean and Korean-to-English Neural Machine Translation Systems by Linguistic Test Points
Sung-Kwon Choi | Gyu-Hyeun Choi | Youngkil Kim
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation
Sung-Kwon Choi | Gyu-Hyeun Choi | Youngkil Kim
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation
2015
Semi-automatic Filtering of Translation Errors in Triangle Corpus
Sung-Kwon Choi | Jong-Hun Shin | Young-Gil Kim
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation: Posters
Sung-Kwon Choi | Jong-Hun Shin | Young-Gil Kim
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation: Posters
2013
Patent translation as technical document translation: customizing a Chinese-Korean MT system to patent domain
Yun Jin | Oh-Woog Kwon | Seung-Hoon Na | Young-Gil Kim
Proceedings of the 5th Workshop on Patent Translation
Yun Jin | Oh-Woog Kwon | Seung-Hoon Na | Young-Gil Kim
Proceedings of the 5th Workshop on Patent Translation
2012
Applying Statistical Post-Editing to English-to-Korean Rule-based Machine Translation System
Ki-Young Lee | Young-Gil Kim
Proceedings of the 26th Pacific Asia Conference on Language, Information, and Computation
Ki-Young Lee | Young-Gil Kim
Proceedings of the 26th Pacific Asia Conference on Language, Information, and Computation
2011
Improving PP Attachment Disambiguation in a Rule-based Parser
Yoon-Hyung Roh | Ki-Young Lee | Young-Gil Kim
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
Yoon-Hyung Roh | Ki-Young Lee | Young-Gil Kim
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation
2009
Incorporating Statistical Information of Lexical Dependency into a Rule-Based Parser
Yoon-Hyung Roh | Ki-Young Lee | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
Yoon-Hyung Roh | Ki-Young Lee | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
Effective Use of Chinese Structural Auxiliaries for Chinese Parsing
Yun Jin | Qing Li | Yingshun Wu | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
Yun Jin | Qing Li | Yingshun Wu | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
Customizing an English-Korean Machine Translation System for Patent/Technical Documents Translation
Oh-Woog Kwon | Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
Oh-Woog Kwon | Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Young-Gil Kim
Proceedings of the 23rd Pacific Asia Conference on Language, Information and Computation, Volume 2
2008
Paraphrasing Depending on Bilingual Context Toward Generalization of Translation Knowledge
Young-Sook Hwang | YoungKil Kim | Sangkyu Park
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I
Young-Sook Hwang | YoungKil Kim | Sangkyu Park
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I
How to Overcome the Domain Barriers in Pattern-Based Machine Translation System
Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Oh-Woog Kwon | Young-Gil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Oh-Woog Kwon | Young-Gil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
What is Needed the Most in MT-Supported Paper Writing
Chang Hyun Kim | Oh-Woog Kwon | Young Kil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
Chang Hyun Kim | Oh-Woog Kwon | Young Kil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
Recognizing Coordinate Structures for Machine Translation of English Patent Documents
Yoon-Hyung Roh | Ki-Young Lee | Sung-Kwon Choi | Oh-Woog Kwon | Young-Gil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
Yoon-Hyung Roh | Ki-Young Lee | Sung-Kwon Choi | Oh-Woog Kwon | Young-Gil Kim
Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation
2007
Getting professional translation through user interaction
Young-Ae Seo | Chang-Hyun Kim | Seong-Il Yang | Young-gil Kim
Proceedings of Machine Translation Summit XI: Papers
Young-Ae Seo | Chang-Hyun Kim | Seong-Il Yang | Young-gil Kim
Proceedings of Machine Translation Summit XI: Papers
English-Korean patent system: fromTo-EK/PAT
Oh-Woog Kwon | Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Young-Gil Kim | Munpyo Hong
Proceedings of the Workshop on Patent translation
Oh-Woog Kwon | Sung-Kwon Choi | Ki-Young Lee | Yoon-Hyung Roh | Young-Gil Kim | Munpyo Hong
Proceedings of the Workshop on Patent translation
Customizing an English-Korean Machine Translation System for Patent Translation
Sung-Kwon Choi | Young-Gil Kim
Proceedings of the 21st Pacific Asia Conference on Language, Information and Computation
Sung-Kwon Choi | Young-Gil Kim
Proceedings of the 21st Pacific Asia Conference on Language, Information and Computation
Semi-Automatic Annotation Tool to Build Large Dependency Tree-Tagged Corpus
Eun-Jin Park | Jae-Hoon Kim | Chang-Hyun Kim | Young-Kill Kim
Proceedings of the 21st Pacific Asia Conference on Language, Information and Computation
Eun-Jin Park | Jae-Hoon Kim | Chang-Hyun Kim | Young-Kill Kim
Proceedings of the 21st Pacific Asia Conference on Language, Information and Computation
2005
Customizing a Korean-English MT System for Patent Translation
Munpyo Hong | Young-Gil Kim | Chang-Hyun Kim | Seong-Il Yang | Young-Ae Seo | Cheol Ryu | Sang-Kyu Park
Proceedings of Machine Translation Summit X: Papers
Munpyo Hong | Young-Gil Kim | Chang-Hyun Kim | Seong-Il Yang | Young-Ae Seo | Cheol Ryu | Sang-Kyu Park
Proceedings of Machine Translation Summit X: Papers
This paper addresses a customization process of a Korean-English MT system for patent translation. The major customization steps include terminology construction, linguistic study, and the modification of the existing analysis and generation-module. T o our knowledge, this is the first worth-mentioning large-scale customization effort of an MT system for Korean and English. This research was performed under the auspices of the MIC (Ministry of Information and Communication) of Korean government. A prototype patent MT system for electronics domain was installed and is being tested in the Korean Intellectual Property Office.
Terminology Construction Workflow for Korean-English Patent MT
Young-Gil Kim | Seong-Il Yang | Munpyo Hong | Chang-Hyun Kim | Young-Ae Seo | Cheol Ryu | Sang-Kyu Park | Se-Young Park
Workshop on patent translation
Young-Gil Kim | Seong-Il Yang | Munpyo Hong | Chang-Hyun Kim | Young-Ae Seo | Cheol Ryu | Sang-Kyu Park | Se-Young Park
Workshop on patent translation
This paper addresses the workflow for terminology construction for Korean-English patent MT system. The workflow consists of the stage for setting lexical goals and the semi- automatic terminology construction stage. As there is no comparable system, it is difficult to determine how many terms are needed. To estimate the number of the needed terms, we analyzed 45,000 patent documents. Given the limited time and budget, we resorted to the semi-automatic methods to create the bilingual term dictionary in electronics domain. We will show that parenthesis information in Korean patent documents and bilingual title corpus can be successfully used to build a bilingual term dictionary.
2004
Semi-Automatic Construction of Korean-Chinese Verb Patterns Based on Translation Equivalency
Munpyo Hong | Young-Kil Kim | Sang-Kyu Park | Young-Jik Lee
Proceedings of the Workshop on Multilingual Linguistic Resources
Munpyo Hong | Young-Kil Kim | Sang-Kyu Park | Young-Jik Lee
Proceedings of the Workshop on Multilingual Linguistic Resources
2002
Korean-Chinese machine translation based on verb patterns
Changhyun Kim | Munpyo Hong | Yinxia Huang | Young Kil Kim | Sung Il Yang | Young Ae Seo | Sung-Kwon Choi
Proceedings of the 5th Conference of the Association for Machine Translation in the Americas: Technical Papers
Changhyun Kim | Munpyo Hong | Yinxia Huang | Young Kil Kim | Sung Il Yang | Young Ae Seo | Sung-Kwon Choi
Proceedings of the 5th Conference of the Association for Machine Translation in the Americas: Technical Papers
This paper describes our ongoing project “Korean-Chinese Machine Translation System”. The main knowledge of our system is verb patterns. Each verb can have several meanings and each meaning of a verb is represented by a verb pattern. A verb pattern consists of a source language pattern part for the analysis and the corresponding target language pattern part for the generation. Each pattern part, according to the degree of generality, contains lexical or semantic information for the arguments or adjuncts of each verb meaning. In this approach, accurate analysis can directly lead to natural and correct generation. Furthermore as the transfer mainly depends upon verb patterns, the translation rate is expected to go higher, as the size of verb pattern grows larger.
2001
Verb Pattern Based Korean-Chinese Machine Translation System
Changhyun Kim | Young Kil Kim | Munpyo Hong | Young Ae Seo | Sung Il Yang | Sung-Kwon Choi
Proceedings of the 16th Pacific Asia Conference on Language, Information and Computation
Changhyun Kim | Young Kil Kim | Munpyo Hong | Young Ae Seo | Sung Il Yang | Sung-Kwon Choi
Proceedings of the 16th Pacific Asia Conference on Language, Information and Computation
1999
From To K/E: a Korean-English machine translation system based on idiom recognition and fail softening
Byong-Rae Ryu | Youngkil Kim | Sanghwa Yuh | Sangkyu Park
Proceedings of Machine Translation Summit VII
Byong-Rae Ryu | Youngkil Kim | Sanghwa Yuh | Sangkyu Park
Proceedings of Machine Translation Summit VII
In this paper we describe and experimentally evaluate FromTo K/E, a rule-based Korean-English machine translation system adapting transfer methodology. In accordance with the view that a successful Korean-English machine translation system presumes a highly efficient robust Korean parser, we develop a parser reinforced with "Fail Softening", i.e. the long sentence segmentation and the recovery of failed parse trees. To overcome the language-typological differences between Korean and English, we adopt a powerful module for processing Korean multi-word lexemes and Korean idiomatic expressions. Prior to parsing Korean sentences, furthermore, we try to resolve the ambiguity of words with unknown grammatical functions on the basis of the collocation and subcategorization information. The results of the experimental evaluation show that the degree of understandability for sample 2000 sentences amounts to 2.67, indicating that the meaning of the translated English sentences is almost clear to users, but the sentences still include minor grammatical or stylistic errors up to max. 30% of the whole words.
Search
Fix author
Co-authors
- Sung-Kwon Choi 9
- Chang-Hyun Kim 7
- Ki-Young Lee 7
- Yoon-Hyung Roh 7
- Munpyo Hong 6
- Oh-Woog Kwon 6
- Sang-Kyu Park 5
- Young-Ae Seo 5
- Jong-hun Shin 3
- Seong-Il Yang 3
- Yun Jin 2
- Seung-Hoon Na 2
- Cheol Ryu 2
- Sung Il Yang 2
- GyuHyeon Choi 1
- Gyu-Hyeun Choi 1
- Yinxia Huang 1
- Young-Sook Hwang 1
- Baikjin Jung 1
- Hwa-Yeon Kim 1
- Jintae Kim 1
- Harksoo Kim 1
- Jae-Hoon Kim 1
- Jihyung Lee 1
- Wonkee Lee 1
- Jong-Hyeok Lee 1
- Young-Jik Lee 1
- Hyeon-gu Lee 1
- Yeonsoo Lee 1
- Qing Li 1
- Jinwoon Min 1
- Se-Young Park 1
- Kwanghyeon Park 1
- Eun-Jin Park 1
- Byong-Rae Ryu 1
- Jaehun Shin 1
- Yingshun Wu 1
- Sanghwa Yuh 1