Chi-Hsin Yu


2014

pdf bib
Chinese Word Ordering Errors Detection and Correction for Non-Native Chinese Language Learners
Shuk-Man Cheng | Chi-Hsin Yu | Hsin-Hsi Chen
Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers

2013

pdf bib
Analyses of the Association between Discourse Relation and Sentiment Polarity with a Chinese Human-Annotated Corpus
Hen-Hsen Huang | Chi-Hsin Yu | Tai-Wei Chang | Cong-Kai Lin | Hsin-Hsi Chen
Proceedings of the 7th Linguistic Annotation Workshop and Interoperability with Discourse

2012

pdf bib
廣義知網詞彙意見極性的預測 (Predicting the Semantic Orientation of Terms in E-HowNet) [In Chinese]
Cheng-Ru Li | Chi-Hsin Yu | Hsin-Hsi Chen
International Journal of Computational Linguistics & Chinese Language Processing, Volume 17, Number 2, June 2012-Specia Issue on Selected Papers from ROCLING XXIII

pdf bib
Detecting Word Ordering Errors in Chinese Sentences for Learning Chinese as a Foreign Language
Chi-Hsin Yu | Hsin-Hsi Chen
Proceedings of COLING 2012

pdf bib
Chinese Web Scale Linguistic Datasets and Toolkit
Chi-Hsin Yu | Hsin-Hsi Chen
Proceedings of COLING 2012: Demonstration Papers

pdf bib
Development of a Web-Scale Chinese Word N-gram Corpus with Parts of Speech Information
Chi-Hsin Yu | Yi-jie Tang | Hsin-Hsi Chen
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)

Web provides a large-scale corpus for researchers to study the language usages in real world. Developing a web-scale corpus needs not only a lot of computation resources, but also great efforts to handle the large variations in the web texts, such as character encoding in processing Chinese web texts. In this paper, we aim to develop a web-scale Chinese word N-gram corpus with parts of speech information called NTU PN-Gram corpus using the ClueWeb09 dataset. We focus on the character encoding and some Chinese-specific issues. The statistics about the dataset is reported. We will make the resulting corpus a public available resource to boost the Chinese language processing.

2011

pdf bib
廣義知網詞彙意見極性的預測 (Predicting the Semantic Orientation of Terms in E-HowNet) [In Chinese]
Cheng-Ru Li | Chi-Hsin Yu | Hsin-Hsi Chen
Proceedings of the 23rd Conference on Computational Linguistics and Speech Processing (ROCLING 2011)