@inproceedings{matteson-etal-2018-rich,
    title = "Rich Character-Level Information for {K}orean Morphological Analysis and Part-of-Speech Tagging",
    author = "Matteson, Andrew  and
      Lee, Chanhee  and
      Kim, Youngbum  and
      Lim, Heuiseok",
    editor = "Bender, Emily M.  and
      Derczynski, Leon  and
      Isabelle, Pierre",
    booktitle = "Proceedings of the 27th International Conference on Computational Linguistics",
    month = aug,
    year = "2018",
    address = "Santa Fe, New Mexico, USA",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/iwcs-25-ingestion/C18-1210/",
    pages = "2482--2492",
    abstract = "Due to the fact that Korean is a highly agglutinative, character-rich language, previous work on Korean morphological analysis typically employs the use of sub-character features known as graphemes or otherwise utilizes comprehensive prior linguistic knowledge (i.e., a dictionary of known morphological transformation forms, or actions). These models have been created with the assumption that character-level, dictionary-less morphological analysis was intractable due to the number of actions required. We present, in this study, a multi-stage action-based model that can perform morphological transformation and part-of-speech tagging using arbitrary units of input and apply it to the case of character-level Korean morphological analysis. Among models that do not employ prior linguistic knowledge, we achieve state-of-the-art word and sentence-level tagging accuracy with the Sejong Korean corpus using our proposed data-driven Bi-LSTM model."
}Markdown (Informal)
[Rich Character-Level Information for Korean Morphological Analysis and Part-of-Speech Tagging](https://preview.aclanthology.org/iwcs-25-ingestion/C18-1210/) (Matteson et al., COLING 2018)
ACL