Charles Lam
2022
PyCantonese: Cantonese Linguistics and NLP in Python
Jackson Lee
|
Litong Chen
|
Charles Lam
|
Chaak Ming Lau
|
Tsz-Him Tsui
Proceedings of the Thirteenth Language Resources and Evaluation Conference
This paper introduces PyCantonese, an open-source Python library for Cantonese linguistics and natural language processing. After the library design, implementation, corpus data format, and key datasets included are introduced, the paper provides an overview of the currently implemented functionality: stop words, handling Jyutping romanization, word segmentation, part-of-speech tagging, and parsing Cantonese text.
2020
Forms and Meanings of Lexical Reduplications in Cantonese: a corpus study
Charles Lam
Proceedings of the 34th Pacific Asia Conference on Language, Information and Computation
2014
A Unified Analysis to Surpass Comparative and Experiential Aspect
Charles Lam
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computing
2013
Reduplication across Categories in Cantonese
Charles Lam
Proceedings of the 27th Pacific Asia Conference on Language, Information, and Computation (PACLIC 27)
Search