基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus)
Changwei Xu (许长伟), Minxuan Feng (冯敏萱), Bin Li (李斌), Yiguo Yuan (袁义国)
Abstract
《古籍汉字分级字表》是基于大规模古籍文本语料库、为辅助学习者古籍文献阅读而研制的分级字表。该字表填补了古籍字表研究成果的空缺,依据各汉字学习优先级别的不同,实现了古籍汉字的等级划分,目前收录一级字105个,二级字340个,三级字555个。本文介绍了该字表研制的主要依据和基本步骤,并将其与传统识字教材“三百千”及《现代汉语常用字表》进行比较,验证了其收字的合理性。该字表有助于学习者优先掌握古籍文本常用字,提升古籍阅读能力,从而促进中华优秀传统文化的继承与发展。- Anthology ID:
- 2021.ccl-1.70
- Volume:
- Proceedings of the 20th Chinese National Conference on Computational Linguistics
- Month:
- August
- Year:
- 2021
- Address:
- Huhhot, China
- Editors:
- Sheng Li (李生), Maosong Sun (孙茂松), Yang Liu (刘洋), Hua Wu (吴华), Kang Liu (刘康), Wanxiang Che (车万翔), Shizhu He (何世柱), Gaoqi Rao (饶高琦)
- Venue:
- CCL
- SIG:
- Publisher:
- Chinese Information Processing Society of China
- Note:
- Pages:
- 781–791
- Language:
- Chinese
- URL:
- https://aclanthology.org/2021.ccl-1.70
- DOI:
- Cite (ACL):
- Changwei Xu, Minxuan Feng, Bin Li, and Yiguo Yuan. 2021. 基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus). In Proceedings of the 20th Chinese National Conference on Computational Linguistics, pages 781–791, Huhhot, China. Chinese Information Processing Society of China.
- Cite (Informal):
- 基于大规模语料库的《古籍汉字分级字表》研究(The Formulation of The graded Chinese character list of ancient books Based on Large-scale Corpus) (Xu et al., CCL 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2021.ccl-1.70.pdf