基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus)

Changwei Xu (许长伟); Minxuan Feng; Bin Li; Yiguo Yuan (袁义国)

基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus)

Changwei Xu (许长伟), Minxuan Feng (冯敏萱), Bin Li (李斌), Yiguo Yuan (袁义国)

Abstract

《古籍汉字分级字表》是基于大规模古籍文本语料库、为辅助学习者古籍文献阅读而研制的分级字表。该字表填补了古籍字表研究成果的空缺,依据各汉字学习优先级别的不同,实现了古籍汉字的等级划分,目前收录一级字105个,二级字340个,三级字555个。本文介绍了该字表研制的主要依据和基本步骤,并将其与传统识字教材“三百千”及《现代汉语常用字表》进行比较,验证了其收字的合理性。该字表有助于学习者优先掌握古籍文本常用字,提升古籍阅读能力,从而促进中华优秀传统文化的继承与发展。

Anthology ID:: 2021.ccl-1.70
Volume:: Proceedings of the 20th Chinese National Conference on Computational Linguistics
Month:: August
Year:: 2021
Address:: Huhhot, China
Editors:: Sheng Li (李生), Maosong Sun (孙茂松), Yang Liu (刘洋), Hua Wu (吴华), Kang Liu (刘康), Wanxiang Che (车万翔), Shizhu He (何世柱), Gaoqi Rao (饶高琦)
Venue:: CCL
SIG:
Publisher:: Chinese Information Processing Society of China
Note:
Pages:: 781–791
Language:: Chinese
URL:: https://aclanthology.org/2021.ccl-1.70
DOI:
Bibkey:
Cite (ACL):: Changwei Xu, Minxuan Feng, Bin Li, and Yiguo Yuan. 2021. 基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus). In Proceedings of the 20th Chinese National Conference on Computational Linguistics, pages 781–791, Huhhot, China. Chinese Information Processing Society of China.
Cite (Informal):: 基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus) (Xu et al., CCL 2021)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-2/2021.ccl-1.70.pdf

PDF Search