Huyin H. Xie


2022

pdf
Construction of Segmentation and Part of Speech Annotation Model in Ancient Chinese
Longjie Jiang | Qinyu C. Chang | Huyin H. Xie | Zhuying Z. Xia
Proceedings of the Second Workshop on Language Technologies for Historical and Ancient Languages

Among the four civilizations in the world with the longest history, only Chinese civilization has been inherited and never interrupted for 5000 years. An important factor is that the Chinese nation has the fine tradition of sorting out classics. Recording history with words, inheriting culture through continuous collation of indigenous accounts, and maintaining the spread of Chinese civilization. In this competition, the siku-roberta model was introduced into the part-of-speech tagging task of ancient Chinese by using the Zuozhuan data set, and good prediction results were obtained.