Chao-Lin Liu


2022

pdf
Introducing a Large Corpus of Tokenized Classical Chinese Poems of Tang and Song Dynasties
Chao-Lin Liu | Ti-Yong Zheng | Kuan-Chun Chen | Meng-Han Chung
Proceedings of the 2nd International Workshop on Natural Language Processing for Digital Humanities

Classical Chinese poems of Tang and Song dynasties are an important part for the studies of Chinese literature. To thoroughly understand the poems, properly segmenting the verses is an important step for human readers and software agents. Yet, due to the availability of data and the costs of annotation, there are still no known large and useful sources that offer classical Chinese poems with annotated word boundaries. In this project, annotators with Chinese literature background labeled 32399 poems. We analyzed the annotated patterns and conducted inter-rater agreement studies about the annotations. The distributions of the annotated patterns for poem lines are very close to some well-known professional heuristics, i.e., that the 2-2-1, 2-1-2, 2-2-1-2, and 2-2-2-1 patterns are very frequent. The annotators agreed well at the line level, but agreed on the segmentations of a whole poem only 43% of the time. We applied a traditional machine-learning approach to segment the poems, and achieved promising results at the line level as well. Using the annotated data as the ground truth, these methods could segment only about 18% of the poems completely right under favorable conditions. Switching to deep-learning methods helped us achieved better than 30%.

pdf
Using Machine Learning and Pattern-Based Methods for Identifying Elements in Chinese Judgment Documents of Civil Cases
Hong-Ren Lin | Wei-Zhi Liu | Chao-Lin Liu | Chieh Yang
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)

Providing structural information about civil cases for judgement prediction systems or recommendation systems can enhance the efficiency of the inference procedures and the justifiability of produced results. In this research, we focus on the civil cases about alimony, which is a relatively uncommon choice in current applications of artificial intelligence in law. We attempt to identify the statements for four types of legal functions in judgement documents, i.e., the pleadings of the applicants, the responses of the opposite parties, the opinions of the courts, and uses of laws to reach the final decisions. In addition, we also try to identify the conflicting issues between the plaintiffs and the defendants in the judgement documents.

pdf
Predicting Judgments and Grants for Civil Cases of Alimony for the Elderly
Wei-Zhi Liu | Po-Hsien Wu | Hong-Ren Lin | Chao-Lin Liu
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)

The needs for mediation are increasing rapidly along with the increasing number of cases of the alimony for the elderly in recent years. Offering a prediction mechanism for predicting the outcomes of some prospective lawsuits may alleviate the workload of the mediation courts. This research aims to offer the predictions for the judgments and the granted alimony for the plaintiffs of such civil cases in Chinese, based on our analysis of results of the past lawsuits. We hope that the results can be helpful for both the involved parties and the courts. To build the current system, we segment and vectorize the texts of the judgement documents, and apply the logistic regression and model tree models for predicting the judgments and for estimating the granted alimony of the cases, respectively.

pdf
Clustering Issues in Civil Judgments for Recommending Similar Cases
Yi-Fan Liu | Chao-Lin Liu | Chieh Yang
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)

Similar judgments search is an important task in legal practice, from which valuable legal insights can be obtained. Issues are disputes between both parties in civil litigation, which represents the core topics to be considered in the trials. Many studies calculate the similarity between judgments from different perspectives and methods. We first cluster the issues in the judgments, and then encode the judgments with vectors for whether or not the judgments contain issues in the corresponding clusters. The similarity between the judgments are evaluated based on the encoded messages. We verify the effectiveness of the system with a human scoring process by a legal background assistant, while comparing the effects of several combinations of preprocessing steps and selections of clustering strategies.

pdf
MIGBaseline at ROCLING 2022 Shared Task: Report on Named Entity Recognition Using Chinese Healthcare Datasets
Hsing-Yuan Ma | Wei-Jie Li | Chao-Lin Liu
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)

Named Entity Recognition (NER) tools have been in development for years, yet few have been aimed at medical documents. The increasing needs for analyzing medical data makes it crucial to build a sophisticated NER model for this missing area. In this paper, W2NER, the state-of-the-art NER model, which has excelled in English and Chinese tasks, is run through selected inputs, several pretrained language models, and training strategies. The objective was to build an NER model suitable for healthcare corpora in Chinese. The best model managed to achieve an F1 score at 81.93%, which ranked first in the ROCLING 2022 shared task.

2020

pdf
Natural Language Processing for Digital Humanities
Chao-Lin Liu | Jen-Joe Hung | Su-bing Chang | Wan-yi Wu
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing (ROCLING 2020)

pdf
Optical Character Recognition, Word Segmentation, Sentence Segmentation, and Information Extraction for Historical and Literature Texts in Classical Chinese
Chao-Lin Liu
Proceedings of the 32nd Conference on Computational Linguistics and Speech Processing (ROCLING 2020)

2019

pdf
基於語境特徵及分群模型之中文多義詞消歧(Using Contextual Information in Clustering Methods for Chinese Word Disambiguation)
Yu-Yuan Lee | Tzu-Hao Chou | Chao-Lin Liu
Proceedings of the 31st Conference on Computational Linguistics and Speech Processing (ROCLING 2019)

2016

pdf
Tracking Words in Chinese Poetry of Tang and Song Dynasties with the China Biographical Database
Chao-Lin Liu | Kuo-Feng Luo
Proceedings of the Workshop on Language Technology Resources and Tools for Digital Humanities (LT4DH)

(This is the abstract for the submission.) Large-scale comparisons between the poetry of Tang and Song dynasties shed light on how words and expressions were used and shared among the poets. That some words were used only in the Tang poetry and some only in the Song poetry could lead to interesting research in linguistics. That the most frequent colors are different in the Tang and Song poetry provides a trace of the changing social circumstances in the dynasties. Results of the current work link to research topics of lexicography, semantics, and social transitions. We discuss our findings and present our algorithms for efficient comparisons among the poems, which are crucial for completing billion times of comparisons within acceptable time.

2015

pdf
Toward Algorithmic Discovery of Biographical Information in Local Gazetteers of Ancient China
Chao-Lin Liu | Chih-Kai Huang | Hongsu Wang | Peter K. Bol
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation

pdf
Color Aesthetics and Social Networks in Complete Tang Poems: Explorations and Discoveries
Chao-Lin Liu | Hongsu Wang | Wen-Huei Cheng | Chu-Ting Hsu | Wei-Yun Chiu
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation: Posters

pdf
《全唐詩》的分析、探勘與應用-風格、對仗、社會網路與對聯(Textual Analysis of Complete Tang Poems for Discoveries and Applications - Style, Antitheses, Social Networks, and Couplets)[In Chinese]
Chao-Lin Liu | Chun-Ning Chang | Chu-Ting Hsu | Wen-Hui Cheng | Hongsu Wang | Wei-Yun Chiu
Proceedings of the 27th Conference on Computational Linguistics and Speech Processing (ROCLING 2015)

2013

pdf
Chinese Spelling Check Evaluation at SIGHAN Bake-off 2013
Shih-Hung Wu | Chao-Lin Liu | Lung-Hao Lee
Proceedings of the Seventh SIGHAN Workshop on Chinese Language Processing

pdf
中英文的文字蘊涵與閱讀測驗的初步探索 (An Exploration of Textual Entailment and Reading Comprehension for Chinese and English) [In Chinese]
Wei-Jie Huang | Po-Cheng Lin | Chao-Lin Liu
Proceedings of the 25th Conference on Computational Linguistics and Speech Processing (ROCLING 2013)

2012

pdf
英文介系詞片語定位與英文介系詞推薦 (Attachment of English Prepositional Phrasesand Suggestions of English Prepositions) [In Chinese]
Chia-Chi Tsai | Chao-Lin Liu
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing (ROCLING 2012)

pdf
應用平行語料建構中文斷詞組件 (Applications of Parallel Corpora for Chinese Segmentation) [In Chinese]
Jui-Ping Wang | Chao-Lin Liu
Proceedings of the 24th Conference on Computational Linguistics and Speech Processing (ROCLING 2012)

pdf bib
Effects of Combining Bilingual and Collocational Information on Translation of English and Chinese Verb-Noun Pairs
Yi-Hsuan Chuang | Chao-Lin Liu | Jing-Shin Chang
International Journal of Computational Linguistics & Chinese Language Processing, Volume 17, Number 3, September 2012

pdf bib
Applications of GPC Rules and Character Structures in Games for Learning Chinese Characters
Wei-Jie Huang | Chia-Ru Chou | Yu-Lin Tzeng | Chia-Ying Lee | Chao-Lin Liu
Proceedings of the ACL 2012 System Demonstrations

2011

pdf
英文技術文獻中一般動詞與其受詞之中文翻譯的語境效用 (Collocational Influences on the Chinese Translations of Non-Technical English Verbs and Their Objects in Technical Documents) [In Chinese]
Yi-Hsuan Chuang | Jui-Ping Wang | Chia-Chi Tsai | Chao-Lin Liu
Proceedings of the 23rd Conference on Computational Linguistics and Speech Processing (ROCLING 2011)

pdf
Some Chances and Challenges in Applying Language Technologies to Historical Studies in Chinese
Chao-Lin Liu | Guantao Jin | Qingfeng Liu | Wei-Yun Chiu | Yih-Soong Yu
International Journal of Computational Linguistics & Chinese Language Processing, Volume 16, Number 1-2, March/June 2011

pdf
Translating Common English and Chinese Verb-Noun Pairs in Technical Documents with Collocational and Bilingual Information
Yi-Hsuan Chuang | Chao-Lin Liu | Jing-Shin Chang
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation

pdf
A Construction Grammar Approach to Prepositional Phrase Attachment: Semantic Feature Analysis of V NP1 into NP2 Construction
Liyin Chen | Siaw-Fong Chung | Chao-Lin Liu
Proceedings of the 25th Pacific Asia Conference on Language, Information and Computation

2010

pdf
Reducing the False Alarm Rate of Chinese Character Error Detection and Correction
Shih-Hung Wu | Yong-Zhi Chen | Ping-che Yang | Tsun Ku | Chao-Lin Liu
CIPS-SIGHAN Joint Conference on Chinese Language Processing

pdf
Visually and Phonologically Similar Characters in Incorrect Simplified Chinese Words
Chao-Lin Liu | Min-Hua Lai | Yi-Hsuan Chuang | Chia-Ying Lee
Coling 2010: Posters

pdf
以語文特徵為基之中學閱讀測驗短文分級 (Using Linguistic Features to Classify Texts for Reading Comprehension Tests at the High School Levels) [In Chinese]
Chao-Shainn Huang | Wei-Ti Kuo | Chia-Ling Li | Chia-Chi Tsai | Chao-Lin Liu
Proceedings of the 22nd Conference on Computational Linguistics and Speech Processing (ROCLING 2010)

pdf
以共現資訊為基礎增進中學英漢翻譯試題與解答之詞彙對列 (Using Co-Occurrence Information to Improve Chinese-English Word Alignment in Translation Test Items for High School Students) [In Chinese]
Chao-Shainn Huang | Yu-Chi Chang | Chao-Lin Liu | Yuan-Hsien Tseng
Proceedings of the 22nd Conference on Computational Linguistics and Speech Processing (ROCLING 2010)

pdf
中文短句之情緒分類 (Sentiment Classification of Short Chinese Sentences) [In Chinese]
Ying-Tse Sun | Chien-Liang Chen | Chun-Chieh Liu | Chao-Lin Liu | Von-Wun Soo
Proceedings of the 22nd Conference on Computational Linguistics and Speech Processing (ROCLING 2010)

pdf
Using Linguistic Features to Predict Readability of Short Essays for Senior High School Students in Taiwan
Wei-Ti Kuo | Chao-Shainn Huang | Chao-Lin Liu
International Journal of Computational Linguistics & Chinese Language Processing, Volume 15, Number 3-4, September/December 2010

2009

pdf
中英文專利文書之文句對列 (Sentence alignment of English and Chinese patent documents) [In Chinese]
Kan-Wen Tien | Yuen-Hsien Tseng | Chao-Lin Liu
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing

pdf
電腦輔助句子重組試題編製 (Computer assisted test-item generation for sentence reconstruction) [In Chinese]
Chih-Bin Huang | Chao-Lin Liu | Wei-Ti Kuo | Ying-Tse Sun | Min-Hua Lai
Proceedings of the 21st Conference on Computational Linguistics and Speech Processing

pdf bib
專利雙語語料之中、英對照詞自動擷取 (Automatic Term Pair Extraction from Bilingual Patent Corpus) [In Chinese]
Yuen-Hsien Tseng | Chao-Lin Liu | Ze-Jing Chuang
ROCLING 2009 Poster Papers

bib
International Journal of Computational Linguistics & Chinese Language Processing, Volume 14, Number 2, June 2009-Special Issue on Computer Assisted Language Learning
Chao-Lin Liu | Zhao-Ming Gao
International Journal of Computational Linguistics & Chinese Language Processing, Volume 14, Number 2, June 2009-Special Issue on Computer Assisted Language Learning

pdf
Phonological and Logographic Influences on Errors in Written Chinese Words
Chao-Lin Liu | Kan-Wen Tien | Min-Hua Lai | Yi-Hsuan Chuang | Shih-Hung Wu
Proceedings of the 7th Workshop on Asian Language Resources (ALR7)

pdf
Capturing Errors in Written Chinese Words
Chao-Lin Liu | Kan-Wen Tien | Min-Hua Lai | Yi-Hsuan Chuang | Shih-Hung Wu
Proceedings of the ACL-IJCNLP 2009 Conference Short Papers

2008

pdf
Using Structural Information for Identifying Similar Chinese Characters
Chao-Lin Liu | Jen-Hsiang Lin
Proceedings of ACL-08: HLT, Short Papers

bib
Proceedings of the 20th Conference on Computational Linguistics and Speech Processing
Chao-Lin Liu | Berlin Chen
Proceedings of the 20th Conference on Computational Linguistics and Speech Processing

pdf
形音相近的易混淆漢字的搜尋與應用 (Identification and Applications of Visually Confusing Chinese Characters ) [In Chinese]
Chao-Lin Liu | Chih-Pin Huang | Jui-Yu Weng | Yi-Hsuan Chuang
Proceedings of the 20th Conference on Computational Linguistics and Speech Processing

pdf
電腦輔助中學程度漢英翻譯習作環境之建置 (Computer Assisted Learning of English-Chinese Translation for Middle Schoolers) [In Chinese]
Min Hua Lai | Chao-Lin Liu
ROCLING 2008 Poster Papers

pdf
以範例為基礎之英漢TIMSS詴題輔助翻譯 (Example Based Machine Translation of TIMSS Test Items) [In Chinese]
Chih-Chieh Chang | Chao-Lin Liu
ROCLING 2008 Poster Papers

pdf
電腦輔助推薦學術會議論文評審委員之初探 (An Exploration of Algorithmic Recommendation of Reviewers for Conference Manuscripts) [In Chinese]
Yu-Hsi Chen | Chao-Lin Liu
ROCLING 2008 Poster Papers

2007

pdf
以文件分類技術預測股價趨勢 (Predicting Trends of Stock Prices with Text Classification Techniques) [In Chinese]
Jiun-Da Chen | Tai-Ping Wang | Chao-Lin Liu
ROCLING 2007 Poster Papers

pdf
針對數學與科學教育領域之電腦輔助英中試題翻譯系統 (An Exploration of Computer Assisted Translation of Test Items for Mathematics and Sciences) [In Chinese]
Ming-Shin Lu | Zhao Ming Gao | Chao-Lin Liu | Chun-Yen Chang
ROCLING 2007 Poster Papers

2005

pdf
利用向量支撐機辨識中文基底名詞組的初步研究 (A Preliminary Study on Chinese Base NP Detection using SVM) [In Chinese]
Hsi-Wei Chang | Zhao Ming Gao | Chao-Lin Liu
Proceedings of the 17th Conference on Computational Linguistics and Speech Processing

pdf bib
Using Lexical Constraints to Enhance the Quality of Computer-Generated Multiple-Choice Cloze Items
Chao-Lin Liu | Chun-Hung Wang | Zhao-Ming Gao
International Journal of Computational Linguistics & Chinese Language Processing, Volume 10, Number 3, September 2005: Special Issue on Selected Papers from ROCLING XVI

pdf bib
Applications of Lexical Information for Algorithmically Composing Multiple-Choice Cloze Items
Chao-Lin Liu | Chun-Hung Wang | Zhao-Ming Gao | Shang-Ming Huang
Proceedings of the Second Workshop on Building Educational Applications Using NLP

2004

pdf
利用自然語言處理技術自動產生英文克漏詞試題之研究 (A Study on Natural Language Processing Aided Grneration of Multiple-Choice Cloze Items) [In Chinese]
Chun-Hung Wang | Chao-Lin Liu | Zhao Ming Gao
Proceedings of the 16th Conference on Computational Linguistics and Speech Processing

2002

bib
以構詞律與相似法為本的中文動詞自動分類研究 (A Hybrid Approach for Automatic Classification of Chinese Unknown Verbs) [In Chinese]
Hui-Hsin Tseng | Chao-Lin Liu | Zhao-Ming Gao | Keh-Jiann Chen
International Journal of Computational Linguistics & Chinese Language Processing, Volume 7, Number 1, February 2002: Special Issue on HowNet and Its Applications

2001

pdf
中文動詞自動分類研究 (Automatic Classification of Chinese Unknown Verbs) [In Chinese]
Hui-hsin Tseng | Chao-Lin Liu | Zhao Ming Gao | Keh-Jiann Chen
Proceedings of Research on Computational Linguistics Conference XIV

1990

pdf
The Semantic Score Approach to the Disambiguation of PP Attachment Problem
Chao-Lin Liu | Jing-Shin Chang | Keh-Yih Su
Proceedings of Rocling III Computational Linguistics Conference III

1989

pdf
A Quantitative Comparison Between an LR Parser and an ATN Interpreter
Chao-Lin Liu | Keh-Yih Su
Proceedings of Rocling II Computational Linguistics Conference II