Jackson Lee
2022
PyCantonese: Cantonese Linguistics and NLP in Python
Jackson Lee
|
Litong Chen
|
Charles Lam
|
Chaak Ming Lau
|
Tsz-Him Tsui
Proceedings of the Thirteenth Language Resources and Evaluation Conference
This paper introduces PyCantonese, an open-source Python library for Cantonese linguistics and natural language processing. After the library design, implementation, corpus data format, and key datasets included are introduced, the paper provides an overview of the currently implemented functionality: stop words, handling Jyutping romanization, word segmentation, part-of-speech tagging, and parsing Cantonese text.
2016
Linguistica 5: Unsupervised Learning of Linguistic Structure
Jackson Lee
|
John Goldsmith
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations
2015
Morphological Paradigms: Computational Structure and Unsupervised Learning
Jackson Lee
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop
Search